Create an example for configuring the ipfs repo #1303

jacobheun · 2018-04-09T23:15:37Z

Also demonstrates how to customize the repo lock, which is a new addition

daviddias · 2018-04-09T23:52:05Z

examples/custom-ipfs-repo/index.js

+// Initialize our repo and IPFS node
+const myRepo = new Repo('/tmp/custom-repo/.ipfs', customRepositoryOptions)
+const node = new IPFS({
+  repo: myRepo


@jacobheun I think it makes sense to remove the step of requiring 'ipfs-repo' and set it up manually by just adding a repoOptions object to the IPFS constructor options that lets us pass the storagebackends directly.

daviddias · 2018-04-09T23:52:41Z

@Mr0grog mind reviewing this tutorial?

daviddias · 2018-04-09T23:54:57Z

@kumavis @hermanjunge with this addition, it means that a user should be able to plug the parity to IPFS bridge pretty easily. @vmx, you have experience in shipping npm modules that pull rust code, compile it and expose bindings to it, can you help us get the Parity bridge in a nice package?

Parity to IPFS issue #763

jacobheun · 2018-04-10T16:29:26Z

@diasdavid I updated the IPFS constructor to accept repoOptions and the example is updated to leverage that. Let me know if this is what you were thinking, I think it's cleaner.

Mr0grog · 2018-04-12T18:15:19Z

examples/custom-ipfs-repo/README.md

@@ -0,0 +1,24 @@
+# Customizing the IPFS Repo
+
+> This example shows you how to customize your repository, including where your data is stored and how the repo locking is managed.


Why is this formatted as a quote?

It might be helpful to provide some short examples of why or when you’d want to do this, for example:

If you want to store data somewhere that’s not on your local disk, like S3, a Redis instance, a different machine on your local network, or in your own database system, like MongoDB or Postgres, you might use a custom datastore.

If you have multiple browser windows or workers sharing the same IPFS storage, you might want to use a custom lock to coordinate between them. (Locking is used to ensure only one IPFS instance can access a repo at a time.)

I think the parenthetical on that last one is important (maybe it should be called out separately?), since otherwise you have to do some digging to see what operations hold locks — the only one is repo.open() as far as I can tell.

Since you’ve done some thinking here, you might have some better ideas or more concrete examples you could list :)

It would also be helpful to say a little bit here about how to set those things and link to those lines in the index.js file. Linking to the relevant docs at https://github.com/ipfs/js-ipfs-repo might good here, too.

Yes, I think some examples of why would be very helpful, I'll get some added along with the ones you've mentioned as I think they are both likely to be common scenarios.

Also, your feedback is great and really appreciated, thank you.

No problem! Glad it was helpful :)

Mr0grog · 2018-04-12T19:14:45Z

examples/custom-ipfs-repo/README.md

+## Other Options
+
+### Custom Repo Lock
+> This example sets the repo locker to `false`, preventing any locking from happening. If you would like to control how locking happens, such as with a centralized S3 ipfs repo, you can pass in your own custom locker. See [custom-locker.js](./custom-locker.js) for an example of a custom locker that can be used for [datastore-s3](https://github.com/ipfs/js-datastore-s3).


Why is this formatted as a quote?

I know the internals of js-ipfs-repo use the term “locker,” but from the perspective of a native english speaker (and typical programming terminology), this is weird wording. The thing that locks and unlocks is the “lock,” while the thing that holds stuff and has a built-in lock is the “locker” (that would be the whole repo in this case). I feel like this would be clearer as custom-lock.js and class S3Lock.

Mr0grog · 2018-04-12T19:22:44Z

examples/custom-ipfs-repo/custom-locker.js

+
+const PATH = require('path')
+
+class S3Locker {


I think it would be helpful to have a class-level description of what this whole thing is doing. Maybe something this?

Uses an object in an S3 bucket as a lock to signal that an IPFS repo is in use. When the object exists, the repo is in use. You would normally use this to make sure multiple IPFS nodes don’t use the same S3 bucket as a datastore at the same time.

Mr0grog · 2018-04-12T19:27:22Z

examples/custom-ipfs-repo/custom-locker.js

+  lock (dir, callback) {
+    this.lockPath = this.getLockfilePath(dir)
+
+    this.s3.put(this.lockPath, Buffer.from(''), (err, data) => {


Shouldn’t this call this.locked(dir, (error, inUse) => ...) first and call callback(new Error('Repo is locked') (or some similar message) if the lock is already held by someone else?

It looks like the memory lock in js-ipfs-repo doesn’t do this, which I think is a bug (@diasdavid / @dignifiedquire ?) — the fs lock does (via the underlying lock-me library).

Mr0grog · 2018-04-12T19:44:21Z

examples/custom-ipfs-repo/custom-locker.js

+   * @returns {void}
+   */
+  lock (dir, callback) {
+    this.lockPath = this.getLockfilePath(dir)


@diasdavid or @dignifiedquire might be able to say more on this, but it seems like this is contrary to the existing lock API, where you ought to be able to request two different locks simultaneously:

MyLock.lock('first-directory', (error, firstLock) => { MyLock.lock('second-directory', (error, secondLock) => { // do some stuff secondLock.close() firstLock.close() }) })

This is a good catch. While the system currently only manages a single lock file the existing API in memory and fs don't prohibit multiple locks. I'll get this and the locked call added.

Mr0grog · 2018-04-12T20:04:36Z

examples/custom-ipfs-repo/index.js

+
+  /**
+   * Storage Backends are fully customizable. Each backend can be stored in seperate services,
+   * or in a single service. Options can be passed into the datastores via the storageBackendOptions


It’s not really the backend that’s stored in a service, right? Maybe something like this would be a little clearer?

IPFS nodes store different information in separate storageBackends, or datastores. Each storage backend can use the same type of datastore or a different one — you could store your keys in a levelDB database while everything else is in files, for example. (See https://github.com/ipfs/interface-datastore for more about datastores.)

Mr0grog · 2018-04-12T20:06:30Z

examples/custom-ipfs-repo/index.js

+   * property shown below.
+   */
+  storageBackends: {
+    root: require('datastore-fs'), // version and config data will be saved here


This little description is super helpful. Maybe you could add something similar for the rest of these? Alternatively, maybe you could add some of this info to the docs for js-ipfs-repo.

Mr0grog · 2018-04-12T20:07:12Z

examples/custom-ipfs-repo/index.js

+    }
+  },
+
+  // false will disable locking, you can also pass in a custom locker


This comma should be a semicolon.

Mr0grog · 2018-04-12T20:14:40Z

examples/custom-ipfs-repo/index.js

+    console.log('\nAdded file:', filesAdded[0].path, filesAdded[0].hash)
+    fileMultihash = filesAdded[0].hash
+    cb()
+  }),


If this were split into steps and you used async.seq instead of async.series:

(cb) => node.version(cb), (version, cb) => { console.log('Version:', version.version) cb() }, (cb) => node.files.add({ path: 'test-data.txt', content: Buffer.from('We are using a customized repo!') }, cb), (filesAdded, cb) => { console.log('\nAdded file:', filesAdded[0].path, filesAdded[0].hash) cb(null, filesAdded[0].hash) }, (fileMultiHash, cb) => node.files.cat(fileMultihash, cb), // etc.

You could get rid of the error handling logic so it reads a bit clearer for people trying to follow it. This would remove the need for the fileMultihash global, too.

Alternatively, using the promise-based return values would let you do this without a special dependency:

node.on('ready', () => { node.version() .then(version => console.log('Version:', version.version) .then(() => node.files.add({ path: 'test-data.txt', content: Buffer.from('We are using a customized repo!') }) .then(filesAdded => { console.log('\nAdded file:', filesAdded[0].path, filesAdded[0].hash) return filesAdded[0].hash }) .then(fileMultihash => node.files.cat(fileMultihash)) // etc. })

It might also be good to catch any errors at the end whether you use series, seq, or promises, just to demonstrate good practice.

I think the promise structure is much easier to consume and that's a great point about the error catching. I'll include those changes in the next update I push.

Mr0grog · 2018-04-12T20:25:00Z

src/core/index.js

@@ -43,7 +43,7 @@ class IPFS extends EventEmitter {

    if (typeof options.repo === 'string' ||
        options.repo === undefined) {
-      this._repo = defaultRepo(options.repo)
+      this._repo = defaultRepo(options.repo, options.repoPath)


This is a typo, right? In the examples, you set options.repoOptions.

If you are going to change the API by adding a new option, would you please document it with the rest of the constructor options in the README?

Personally, I’m also a little unsure about the value of this. It’s basically just replacing:

new IPFS({ repo: new Repo('/some/path', {options}) })

with

new IPFS({ repo: '/some/path', repoOptions: {options} })

You don’t have to explicitly import ipfs-repo anymore to get a fully customizable repo, but the whole behavior is more complicated as a result (repoOptions won’t do anything if you don’t also set repo or if you set repo to an actual repo instance). There are also now multiple ways to do the same repo customization. I think this makes the API a little less clear.

That said, I’m more here to focus on docs, so I’ll try not to push or say any more on this.

This is a really good point and I agree. The system has a lot of complexity, so avoiding more options in the API is a good thing in my opinion. I think it also helps delineate using the default repo, versus a custom repo, more clearly.

Mr0grog · 2018-04-13T21:15:33Z

examples/custom-ipfs-repo/index.js

+    console.log('\nFetched file content:')
+    process.stdout.write(data)
+    cb()
+  })


I wasn’t thinking about this yesterday, but it might also be useful to inspect the filesystem after all these operations and demonstrate that the custom repo options were used. You’d probably want to set options at the top here that aren’t the same as the defaults so their effects are visible, e.g. extension: '.customblocks' or something.

jacobheun · 2018-04-17T16:06:30Z

@Mr0grog I've incorporated your feedback and I've included the latest lock changes from ipfs/js-ipfs-repo#162.

Mr0grog

Looks great, except for one part that didn’t get updated with the rest! (And one other minor note inline that’s not as big a deal.)

Mr0grog · 2018-04-17T16:24:02Z

examples/custom-ipfs-repo/README.md

+## Other Options
+
+### Custom Repo Lock
+This example sets the repo locker to `false`, preventing any locking from happening. If you would like to control how locking happens, such as with a centralized S3 IPFS Repo, you can pass in your own custom lock. See [custom-lock.js](./custom-lock.js) for an example of a custom lock that can be used for [datastore-s3](https://github.com/ipfs/js-datastore-s3).


This feature (no lock at all) got removed from your repo PR, right? I think the first sentence here needs updating ;)

@Mr0grog good catch! I've pushed up this change along with the node.stop call.

Mr0grog · 2018-04-17T17:23:05Z

examples/custom-ipfs-repo/index.js

+    // Log out the error, if there is one
+    .catch((err) => {
+      console.log('File Processing Error:', err)
+    })


Might be nice to have a final .then(node.stop) that stops the IPFS node so the user doesn’t need to ctrl+c to kill the process.

jacobheun · 2018-04-19T18:18:38Z

@Mr0grog when you have some time can you check the final adjustments I made per your feedback? Thanks!

Mr0grog · 2018-04-19T20:57:41Z

examples/custom-ipfs-repo/index.js

+      console.log('\nFetched file content:')
+      process.stdout.write(data)
+      console.log('\n\nStopping the node')
+      return node.stop()


Minor technical nit: if you put this after the catch, then it will still run if there was an error, which is probably desirable:

.then((data) => { console.log('\nFetched file content:') process.stdout.write(data) }) // Log out the error, if there is one .catch((error) => { console.log('File Processing Error:', error) }) .then(() => { console.log('\n\nStopping the node') return node.stop() })

Otherwise, this all looks lovely! 👍 👍 👍

Good point, I've squashed the change into the previous commit and added a comment about not needing to catch errors on the stop.

daviddias · 2018-04-23T09:26:38Z

examples/custom-ipfs-repo/index.js

+    },
+    blocks: {
+      sharding: false, // Used by IPFSRepo Blockstore to determine sharding; Ignored by datastore-fs
+      extension: '.ipfsblock',


It's currently just .data, let's avoid confusing users.

daviddias · 2018-04-23T09:26:47Z

examples/custom-ipfs-repo/index.js

+      createIfMissing: true
+    },
+    keys: {
+      extension: '.ipfskey',


Does the keys repo use an extension?

daviddias · 2018-04-23T09:27:55Z

examples/custom-ipfs-repo/index.js

+   */
+  storageBackendOptions: {
+    root: {
+      extension: '.ipfsroot', // Used by datastore-fs; Appended to all files


No extension here. Check:

> ls ~/.jsipfs blocks config datastore keys repo.lock version

daviddias · 2018-04-23T09:28:23Z

examples/custom-ipfs-repo/index.js

+      createIfMissing: true
+    },
+    datastore: {
+      extension: '.ipfsds',


No extension used by default

The reason I added the non-default options was to demonstrate that configuration options to the users. Perhaps I could add in some comments about the defaults here and mention that they are being overridden for demonstration purposes only?

Yeah, this would probably be useful to call that out here.

It might also be good to, in the last step of the script, do something like:

.then(() => { console.log('Check "/tmp/custom-repo/.ipfs" to see what your customized repository looks like on disk.') }

daviddias

It's missing how to plug the S3 backend

jacobheun · 2018-04-23T13:12:27Z

@diasdavid js-datastore-s3 has a working example of how to use s3 as a full backend, and we reference that in the comments in this example. I think it would be good to avoid duplication of that example to avoid out of date examples as the project evolves, but maybe I can call out that example more prevalently in the README?

daviddias · 2018-04-23T13:13:39Z

@jacobheun please do, it was not evident to me and it is one of the main reasons on why to upgrade all repos are configured

remove ability to add repoOptions in favor of just supplying a repo chore: remove outdated options

automatically stop the node in the custom repo example docs: update custom repo example

chore: bump custom repo example ipfs-repo version

Mr0grog · 2018-04-23T15:40:24Z

[the S3 repo] is one of the main reasons on why to upgrade all repos are configured

I don’t think that’s correct — it’s not even used in the example at all. The point of changing all the repo configurations is to make it obvious that what you did affected the output on disk. You can’t see the results if you just set everything to their defaults.

fix: resolve bugs in the custom s3 lock

jacobheun · 2018-04-23T16:19:30Z

I've added in some additional comments and updated the readme to try and spell out the differences with the defaults more clearly.

I also updated the full s3 example in the datastore-s3 repo, ipfs/js-datastore-s3#5, to use the S3Lock instead of memory, and fixed a bug with that lock here. That new update has been tested against one of my S3 buckets.

daviddias · 2018-04-24T08:01:27Z

@Mr0grog what I meant was that this endeavor, "upgrading the ipfs-repo module so that it supports custom locking and update js-ipfs to support custom repos" was started because we wanted to enable users to be able to use an S3 bucket as the storage backend.

Most users won't really need to touch the repo if they are just going to use regular fs. Most users, if not all, will trust the defaults. What is really interesting is to have multiple storage backends, from S3 (requested multiple times by IPFS users) and things like #763.

daviddias

LGTM. @Mr0grog any last remarks?

Mr0grog · 2018-04-24T19:05:08Z

what I meant was that this endeavor, upgrading the ipfs-repo module

Ah, sorry, I misunderstood the context here.

Most users won't really need to touch the repo if they are just going to use regular fs. Most users, if not all, will trust the defaults. What is really interesting is to have multiple storage backends

100% agree with all that! I assumed the intent here was not to demo using switching the repo implementation (to S3 or whatever) because it wasn’t actually included in the example. So in absence of that, it seemed important to do something that had an observable effect.

Anyway, looks good to me.

ghost assigned jacobheun Apr 9, 2018

ghost added the status/in-progress In progress label Apr 9, 2018

jacobheun mentioned this pull request Apr 9, 2018

I would like a s3 datastore for this ipfs/js-ipfs-repo#135

Closed

daviddias suggested changes Apr 9, 2018

View reviewed changes

Mr0grog reviewed Apr 12, 2018

View reviewed changes

Mr0grog reviewed Apr 13, 2018

View reviewed changes

Mr0grog mentioned this pull request Apr 16, 2018

feat(lock): allow for custom lock options ipfs/js-ipfs-repo#162

Merged

jacobheun force-pushed the docs/repo-example branch 2 times, most recently from 6dbe59d to 1c23edc Compare April 17, 2018 15:55

Mr0grog suggested changes Apr 17, 2018

View reviewed changes

Mr0grog reviewed Apr 19, 2018

View reviewed changes

jacobheun force-pushed the docs/repo-example branch from 924c808 to 9fac289 Compare April 19, 2018 21:06

daviddias reviewed Apr 23, 2018

View reviewed changes

daviddias suggested changes Apr 23, 2018

View reviewed changes

jacobheun added 6 commits April 23, 2018 17:14

docs: add example to show how to customize ipfs repo

3bb3c79

feat: allow repo options to be passed in the ipfs constructor

58f7da4

docs: update repo creation in custom repo example

4b0d0c0

docs: update custom repo example

66deca1

remove ability to add repoOptions in favor of just supplying a repo chore: remove outdated options

docs: fix incorrect description in custom repo example

e656f9c

automatically stop the node in the custom repo example docs: update custom repo example

docs: clarify custom-repo docs

10b34fb

chore: bump custom repo example ipfs-repo version

jacobheun force-pushed the docs/repo-example branch from 9fac289 to 10b34fb Compare April 23, 2018 15:31

jacobheun mentioned this pull request Apr 23, 2018

windows support and add s3 lock to the example ipfs/js-datastore-s3#5

Merged

docs: add a step to have users check their local repo

95ac4a5

fix: resolve bugs in the custom s3 lock

daviddias approved these changes Apr 24, 2018

View reviewed changes

daviddias mentioned this pull request Apr 30, 2018

⚡️ v0.29.0 RELEASE 🚀 #1320

Closed

30 tasks

daviddias merged commit 61e7f86 into master Apr 30, 2018

ghost removed the status/in-progress In progress label Apr 30, 2018

daviddias deleted the docs/repo-example branch April 30, 2018 14:36

		@@ -0,0 +1,24 @@
		# Customizing the IPFS Repo

		> This example shows you how to customize your repository, including where your data is stored and how the repo locking is managed.

Create an example for configuring the ipfs repo #1303

Create an example for configuring the ipfs repo #1303

Conversation

jacobheun commented Apr 9, 2018

Choose a reason for hiding this comment

daviddias commented Apr 9, 2018

daviddias commented Apr 9, 2018 • edited Loading

jacobheun commented Apr 10, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Mr0grog Apr 12, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jacobheun commented Apr 17, 2018

Mr0grog left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jacobheun commented Apr 19, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daviddias left a comment

Choose a reason for hiding this comment

jacobheun commented Apr 23, 2018

daviddias commented Apr 23, 2018

Mr0grog commented Apr 23, 2018

jacobheun commented Apr 23, 2018

daviddias commented Apr 24, 2018

daviddias left a comment

Choose a reason for hiding this comment

Mr0grog commented Apr 24, 2018 • edited Loading

daviddias commented Apr 9, 2018 •

edited

Loading

Mr0grog Apr 12, 2018 •

edited

Loading

Mr0grog commented Apr 24, 2018 •

edited

Loading