Provide versioning implementation guidance #348

rosy1280 · 2019-05-10T17:46:44Z

There seems to be confusion in the community about how one would implement versioning (see multiple discussions about pessimistic lock). Can we provide recommendations on how to implement versioning, something like a reference implementation maybe using stanford as an example?

ahankinson · 2019-05-10T17:49:14Z

Perhaps a change in title for this issue to "Provide versioning implementation guidance" would be clearer? I initially thought this was about spec versioning

julianmorley · 2019-05-10T18:46:31Z

+1. There's still considerable confusion about the line btw an access-based repository for daily work (c.f. Fedora access patterns) and what preserving those assets long-term looks like.

zimeon · 2019-05-10T19:30:58Z

To me the key point is that when to version is a curation choice, and we might provide implementation advice on making that choice. From the spec point of view I think that versions "just are", but there might still be a link off the to guidelines for advice.

ahankinson · 2019-05-10T19:36:33Z

The sense I get is that it's the nuts-and-bolts of an OCFL client implementation that is the stumbling block: Where to store 'staged' files, how to deal with concurrent writes (use file-based semaphores, or implement gatekeeper software), and the order-of-operations for writing content in a versioning action.

rosy1280 · 2019-05-10T19:57:45Z

i think all of the above is true. should we break this up into smaller parts?

birkland · 2019-05-10T20:27:00Z

To expand on questions related to the nuts and bolts:

Using a particular approach for writing a new version on {a direct fs, NFS, S3/cloud}, how does one detect and recover if the client fails at a given step?
If two un-coordinated clients happen to accidentally write a new version to an object, what happens? Do any approaches on {a direct fs, NFS, S3/cloud} allow data thought to be safely written, to be silently overwritten or corrupted? Can the spec help with this?
Some of the conversation also appeared to cover how an application manages the state of an object before it is committed to OCFL, e.g. in staging, possibly with multiple writers. (this may be out of scope even for implementation notes)

neilsjefferies · 2019-05-22T11:27:56Z

These are implementation details, not part of the OCFL spec. So by all means we can add some stuff to the implementation notes - it's not like this is rocket science.

If we want to update and a temp version directory exists then abort.
Create temporary new version directory, do all operations there.
Manifest still points to last valid version so all reads work OK.
After all updates done, rename the directory to a version one
Create new root manifest with temp name (copy from version directory)
Finally, lock the object and update the manifest by deleting old and renaming new - this should not take very long!
Update the manifest checksum and unlock
To create the new version you probably read the old version so it should be in your Varnish cache anyway - you do have one for reads don't you?

Clean up after failure

Delete any temp directories - this automatically reverts to last good version
If the manifest checksum fails then you will find a new version dir not in the manifest, goto 5 above
You are rechecksumming everything anyway given what just happened?

...temp directory name.. I dunno... "deposit" is as good as any,
...or "deposit_<transaction etag>" to help clean up any other temp DB's you might have hanging around

awoods · 2019-05-29T15:14:19Z

Thanks, @neilsjefferies . This looks like a good start towards an "Implementation Notes" pull-request.

rosy1280 added this to the 1.0 milestone May 10, 2019

rosy1280 added Examples Needs Discussion and removed Examples labels May 10, 2019

rosy1280 changed the title ~~Provide versioning recommendations~~ Provide versioning implementation guidance May 10, 2019

neilsjefferies mentioned this issue Jun 4, 2019

Guidance on versioning implementation 1 #362

Merged

rosy1280 assigned neilsjefferies Jul 16, 2019

rosy1280 closed this as completed in #362 Aug 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide versioning implementation guidance #348

Provide versioning implementation guidance #348

rosy1280 commented May 10, 2019

ahankinson commented May 10, 2019

julianmorley commented May 10, 2019

zimeon commented May 10, 2019

ahankinson commented May 10, 2019

rosy1280 commented May 10, 2019

birkland commented May 10, 2019

neilsjefferies commented May 22, 2019 •

edited

Loading

awoods commented May 29, 2019

Provide versioning implementation guidance #348

Provide versioning implementation guidance #348

Comments

rosy1280 commented May 10, 2019

ahankinson commented May 10, 2019

julianmorley commented May 10, 2019

zimeon commented May 10, 2019

ahankinson commented May 10, 2019

rosy1280 commented May 10, 2019

birkland commented May 10, 2019

neilsjefferies commented May 22, 2019 • edited Loading

awoods commented May 29, 2019

neilsjefferies commented May 22, 2019 •

edited

Loading