add support for batch puts #38

hugomrdias · 2019-10-10T13:21:24Z

This PR adds support for batch puts of blocks into the datastore.

I duplicated the buildFile function with support for batch to be easier to benchmark, we can dedupe before merging this PR.

Two new options were added batch (Boolean) and batchInterval (number).
batch is just a toggle to enable batching
batchInterval is the time interval to accumulate Blocks before putting in the datastore

Batching improves performance and below you will see some numbers, but please run the benchmark files i added to the repo and test in your machines and provide feedback.

browser

    LOG: 'without batch x 1.25 ops/sec ±4.67% (10 runs sampled)'
    LOG: 'batch 50ms  x 1.81 ops/sec ±3.91% (13 runs sampled)'
    LOG: 'batch 100ms  x 1.42 ops/sec ±8.69% (12 runs sampled)'
    LOG: 'batch 150ms  x 1.26 ops/sec ±1.73% (11 runs sampled)'
    LOG: 'batch 200mb  x 1.12 ops/sec ±2.47% (10 runs sampled)'
    LOG: 'Fastest is batch 50ms '
        ✔ benchmark 10mb
    LOG: 'without batch x 0.22 ops/sec ±0.96% (6 runs sampled)'
    LOG: 'batch 50ms  x 0.42 ops/sec ±4.70% (7 runs sampled)'
    LOG: 'batch 100ms  x 0.35 ops/sec ±7.86% (6 runs sampled)'
    LOG: 'batch 150ms  x 0.33 ops/sec ±3.07% (6 runs sampled)'
    LOG: 'batch 200mb  x 0.31 ops/sec ±8.05% (6 runs sampled)'
    LOG: 'Fastest is batch 50ms '
        ✔ benchmark 50mb
    LOG: 'without batch x 0.08 ops/sec ±2.83% (5 runs sampled)'
    LOG: 'batch 50ms  x 0.17 ops/sec ±9.25% (5 runs sampled)'
    LOG: 'batch 100ms  x 0.16 ops/sec ±6.47% (5 runs sampled)'
    LOG: 'batch 150ms  x 0.15 ops/sec ±2.78% (5 runs sampled)'
    LOG: 'batch 200mb  x 0.14 ops/sec ±4.76% (5 runs sampled)'
    LOG: 'Fastest is batch 50ms ,batch 100ms '
        ✔ benchmark 100mb
    
    ✓ single run 500mb without batch (70100ms)
    ✓ single run 500mb with batch 50ms (25559ms) **63.6% Faster**
    ✓ single run 500mb with batch 100ms (29315ms)

node

    ✓ single run 500mb without batch (22605ms)
    ✓ single run 500mb with batch 50ms (5847ms) **74.3% Faster**
    ✓ single run 500mb with batch 100ms (16180ms)
    ✓ single run 500mb with batch 150ms (11050ms)
    ✓ single run 500mb with batch 200ms (6520ms)
    
    without batch x 1.74 ops/sec ±3.75% (13 runs sampled)
    batch 50ms  x 2.87 ops/sec ±3.72% (18 runs sampled)
    batch 100ms  x 2.41 ops/sec ±1.20% (16 runs sampled)
    Fastest is batch 50ms 
        ✓ benchmark 10mb (21379ms)
    without batch x 0.38 ops/sec ±1.79% (6 runs sampled)
    batch 50ms  x 0.77 ops/sec ±5.00% (8 runs sampled)
    batch 100ms  x 0.75 ops/sec ±3.56% (8 runs sampled)
    Fastest is batch 50ms ,batch 100ms 
        ✓ benchmark 50mb (37115ms)
    without batch x 0.21 ops/sec ±1.52% (6 runs sampled)
    batch 50ms  x 0.43 ops/sec ±2.39% (7 runs sampled)
    batch 100ms  x 0.42 ops/sec ±0.70% (7 runs sampled)
    Fastest is batch 50ms 
        ✓ benchmark 100mb (61898ms)
    without batch x 0.10 ops/sec ±4.39% (5 runs sampled)
    batch 50ms  x 0.21 ops/sec ±2.84% (6 runs sampled)
    batch 100ms  x 0.21 ops/sec ±6.17% (6 runs sampled)
    Fastest is batch 50ms ,batch 100ms 
        ✓ benchmark 200mb (106736ms)

Needs:
ipld/js-ipld#238

daviddias · 2019-10-10T14:54:43Z

Does it know how not to choke itself? (i.e. trying to batch 10GB at the same time?)

achingbrain · 2019-10-10T14:59:59Z

This is great but I can't help but think it doesn't belong at this layer in the stack, better in ipld or the blockstore itself?

hugomrdias · 2019-10-10T15:02:15Z

Does it know how not to choke itself? (i.e. trying to batch 10GB at the same time?)

it should never batch large amounts of data unless the user changes the chunker, even then it would be the same as without batching.
the default interval in this PR is 50ms so it will accumulate small blocks (according to the chunker) for 50ms and batch put in one single transaction.
also the timer for batching adjusts itself to not choke even if a single transaction takes more than the interval.

hugomrdias · 2019-10-10T15:05:03Z

This is great but I can't help but think it doesn't belong at this layer in the stack, better in ipld or the blockstore itself?

i know what you mean and agree, i tried to make it like take but with the current code this is the only place we can do it without adding complexity.

if we extract blockstore to unixfs it would be much easier.

vmx · 2019-10-10T15:21:12Z

if we extract blockstore to unixfs it would be much easier.

After a quick look it seems that js-ipld is only used for get() and put(). Those two operations could easily be done without js-ipld directly on the BlockService. I would actually prefer that, as I don't like the js-ipld changes that much, as it leaks abstractions. To quote from the js-ipld README:

The IPLD API works strictly with CIDs and deserialized IPLD Nodes. Interacting with the binary data happens on lower levels.

Adding putBatch() to js-ipld would break that abstraction, as we would directly put binary data.

Also I don't see much benefit in using js-ipld for unixfs. One of the points of js-ipld is, that it is codec agnostic and can work with any provided one and would do the right thing. In unixfs we exactly know which codec it is, and it is always only DagPB. So I would be in favour of using the BlockService directly.

achingbrain · 2019-10-10T15:30:03Z

I think it's important to have boundaries and defined responsibilities in a system. If we have everything depend on everything else it becomes a mess.

I would like this module to be responsible for creating dags from streams of buffers, I would like other modules to be responsible for serializing and persisting nodes and they are then free to apply whatever storage optimisations they see fit, including batching writes.

hugomrdias · 2019-10-10T15:38:19Z

I would like this module to be responsible for creating dags from streams of buffers, I would like other modules to be responsible for serializing and persisting nodes and they are then free to apply whatever storage optimisations they see fit, including batching writes.

i understand and agree but its called unixfs, its hard to have boundaries with the current situation.

can we decouple ipld from blockservice, and in unixfs serialize manually without ipld and use the blockservice directly to persist ? ill port this to the blockservice. What you think ?

daviddias · 2019-10-10T16:01:04Z

#38 (comment)

After a quick look it seems that js-ipld is only used for get() and put(). Those two operations could easily be done without js-ipld directly on the BlockService.

@vmx IPLD should be making things like Unixfs importers & exporters more easy to do, not say to not use it.

There are multiple operations here that Unixfs has to do manually (fetching node by node, checking which nodes it points to) that can be done by js-ipld, it's knowledge of the graph and its understanding of GraphSync.

vmx · 2019-10-10T16:01:15Z

can we decouple ipld from blockservice,

That won't be easy. That's planned for the next generation js-ipld, which won't be a thing soon. Is that actually needed? Why not just passing the BlockService into unixfs instead of js-ipld?

hugomrdias · 2019-10-10T16:02:15Z

can we decouple ipld from blockservice,

That won't be easy. That's planned for the next generation js-ipld, which won't be a thing soon. Is that actually needed? Why not just passing the BlockService into unixfs instead of js-ipld?

thats what i meant ^^

vmx · 2019-10-10T16:07:13Z

@vmx IPLD should be making things like Unixfs importers & exporters more easy to do, not say to not use it.

I would agree for > UnixFSv1. But as UnixFSv1 is now opimized for perf, it just doesn't make sense to use js-ipld, as DagPB isn't even a format that supports the full IPLD Data Model.

There are multiple operations here that Unixfs has to do manually (fetching node by node, checking which nodes it points to) that can be done by js-ipld, it's knowledge of the graph and its understanding of GraphSync.

Those things used to be considered part of IPLD, but that's not the case anymore. IPLD has a different (smaller?) scope these days. Those things would now be on a different layer (which then IPLD will probably use).

autonome · 2019-10-10T16:14:33Z

The benefit of a performance improvement of this magnitude will impact adoption in significant ways. Thanks for digging into it!

hugomrdias · 2019-10-10T16:17:26Z

@vmx and @achingbrain should i go forward with this #38 (comment) ?

achingbrain · 2019-10-10T16:52:58Z

should i go forward with this #38 (comment) ?

Do you mean:

ill port this to the blockservice

If so, yes please.

The benefit of a performance improvement of this magnitude will impact adoption

It's true, but we want the performance improvement to remain even after this module is retired for unixfsv2-importer or whatever, which is why it shouldn't live here.

The responsibilities are:

unixfs-importer: Turns a stream of buffers into nodes in a dag of some shape
ipld: Turns the dag nodes into blocks
block-service: stores blocks

We should not be optimising how we store blocks at a high level - we should pass stream of blocks to ipld, which should pass a stream of blocks to the block service, which should pull them into batches and write as appropriate, if that's the fastest way to write them to the available storage.

achingbrain · 2019-10-10T16:56:57Z

An implication of this change is that importing files is no longer atomic, because we start returning CIDs before they've been written to the block store.

The time window is small but applications will be able to receive CIDs from blocks that they've just added, request the block with the CID and find that it doesn't exist. If the application crashes or otherwise can't write the blocks (disk full, solar flare, I don't know), future users of the block store may not find the blocks they expect to be there.

daviddias · 2019-10-10T17:09:16Z

An implication of this change is that importing files is no longer atomic, because we start returning CIDs before they've been written to the block store.

🔦 Great thing to highlight! Before merging this PR, there must be a test with a faulty repo that fails some of the writes and than the test compares the CIDs that got returned with the CIDs that are in the repo. All of them should be there.

achingbrain · 2019-10-10T17:11:57Z

src/dag-builder/file/index.js

+  const timer = setIntervalAsync(async () => {
+    const temp = nodesToPersist
+    nodesToPersist = []
+    await ipld.putBatch(temp, options)


Where does the error go if this throws? Should we retry the batch? What if retrying fails?

mikeal · 2019-10-10T18:04:15Z

A few quick notes on js-unixfsv2 since it’s far enough along that the design is much clearer.

We put a lot of thought into how we can produce libraries that are useful both in IPFS as dependencies but also useful independently in other libraries or even in something like ipfs-lite.

There’s actually 2 libraries I’ll be producing.

`js-unixfsv2`

This library contains:

Schema generated low level API for encoding and validation of unixsfsv2 data.
A simple read interface for atomic read operations of file metadata and byte data.
A simple write interface for encoding/importing files and directories.

The read interfaces take a single function async get CID => Block which accepts a CID and returns a block. The write interfaces are just generators that produce new Block instances.

`js-unixfsv2-mfs`

This library does the low level work of a mutable file system. It will not implement the full IPFS MFS API, it’ll only provide the necessary functionality for that to be built on top.

The reason I’m writing this as well is because it’s non-trivial to provide transactional integrity of the filesystem without being quite close to the data. It’s also very useful functionality to be able to use independent of IPFS.

This library provides an interface for reads and writes to a mutable filesystem. This library will accept a bulk write function and a block get function and all read/write operations will be behind a mutex that can provide transaction integrity of the reads and writes and will batch all writes by default. This way, anyone writing to it in any way will get batched write behavior by default, even when the writes are being done concurrently. This means that, as far as these recent perf changes making their way into v2, you just need to hand me a bulk write function for whatever underlying data store you want and everything will be optimally batched and we can throw proper errors when concurrent actors try to update the same file.

Some other simple perf improvements I have planned for this: LRU block cache to speed up reads/writes in the same parts of the graph, and a bloom filter to speed up writes of the same data (much faster re-imports).

mikeal · 2019-10-10T18:13:14Z

An implication of this change is that importing files is no longer atomic, because we start returning CIDs before they've been written to the block store.

Is this also going to break back-pressure during large file imports? If we’re continuing to import data without flushing to disc then we’re going to eventually run out of memory once the disc is under enough load.

achingbrain · 2019-10-14T14:28:00Z

This is probably the wrong place to get into unixfsv2 implementation discussions, but..

There’s actually 2 libraries I’ll be producing.

It might minimise disruption to add unixfsv2 resolvers to the existing exporter at js-ipfs-unixfs-exporter/src/resolvers That way there'll be no extra work to required to traverse through a graph that contains unixfsv1 nodes, generic dag-cbor nodes and unixfsv2 nodes. I don't think unixfsv1 is going away any time soon.

js-unixfsv2-mfs

Interesting stuff. We already have something to prevent re-writing existent blocks in ipfs-repo and if @hugomrdias ports the changes here into the blockstore, we'll have batch writes at the block level which may be sufficient or better performance wise, though it's hard to say without measuring it.

The existing mfs implementation uses the same unixfs importer/exporter as ipfs.add, will your new module do the same thing? I ask because go-IPFS has a different import implementation for mfs from ipfs.add which has led to inconsistencies between the two - this is something I'm very keen to avoid.

daviddias · 2019-10-17T08:08:16Z

I would agree for > UnixFSv1. But as UnixFSv1 is now opimized for perf, it just doesn't make sense to use js-ipld, as DagPB isn't even a format that supports the full IPLD Data Model.

@vmx this is the kind of painpoints that any user of IPLD will hit and the reason why our Ethereum friends had to create https://github.com/ipld/js-ipld-graph-builder because the Perf of using js-ipld directly to add lots of dag nodes was terrible (which is a similar problem to what @hugomrdias is trying to fix, the only difference is that the graph-builder is designed for graphs that suffer a lot of changes, while unixfs is just dealing with big graphs).

graph-builder is what lead for the consideration of ipld/js-ipld#66

vmx · 2019-10-17T09:52:34Z

@daviddias I agree. The problem with the current js-ipld is that it didn't get the layers of abstractions quite right. That will be fixed with future iterations of the js ipld stack.

momack2 · 2019-11-05T21:47:36Z

what is the plan on this PR?

achingbrain · 2019-11-06T07:13:40Z

@hugomrdias was going to make these changes in the blockstore instead so the optimisation benefits other codecs and can live on with UnixFSv2.

achingbrain · 2019-11-06T07:17:32Z

Also I don't think we got to the bottom of the conversation around whether we are ok with no longer having atomic writes. There's code in the filestore ensuring all writes are atomic, it'd all be redundant if this is how we want to do things going forward.

Adds two new options: `fileImportConcurrency` This controls the number of files that are imported concurrently. You may wish to set this high if you are importing lots of small files. `blockWriteConcurrency` This controls how many blocks from each file we write to disk at the same time. Setting this high when writing large files will significantly increase import speed, though having it high when `fileImportConcurrency` is also high can swamp the process. It also: 1. Flattens module options because validating deep objects was clunky and the separation of access to config sub objects within this module isn't very good 1. Replaces `superstruct` and `deep-extend` with `merge-options` which is better suited for merging options and is smaller` 1. Replaces `async-iterator-*` modules with the more zeitgeisty `it-*` namespace Supersedes #38, sort of.

Adds two new options: `fileImportConcurrency` This controls the number of files that are imported concurrently. You may wish to set this high if you are importing lots of small files. `blockWriteConcurrency` This controls how many blocks from each file we write to disk at the same time. Setting this high when writing large files will significantly increase import speed, though having it high when `fileImportConcurrency` is also high can swamp the process. It also: 1. Flattens module options because validating deep objects was clunky and the separation of access to config sub objects within this module isn't very good 1. Replaces `superstruct` and `deep-extend` with `merge-options` which is better suited for merging options and is smaller 1. Replaces `async-iterator-*` modules with the more zeitgeisty `it-*` namespace Supersedes #38, sort of.

* perf: concurrent file import Adds two new options: `fileImportConcurrency` This controls the number of files that are imported concurrently. You may wish to set this high if you are importing lots of small files. `blockWriteConcurrency` This controls how many blocks from each file we write to disk at the same time. Setting this high when writing large files will significantly increase import speed, though having it high when `fileImportConcurrency` is also high can swamp the process. It also: 1. Flattens module options because validating deep objects was clunky and the separation of access to config sub objects within this module isn't very good 1. Replaces `superstruct` and `deep-extend` with `merge-options` which is better suited for merging options and is smaller 1. Replaces `async-iterator-*` modules with the more zeitgeisty `it-*` namespace Supersedes #38, sort of. No batching but atomicity guarantees are maintained and performance gains are broadly similar with the right tuning.

achingbrain · 2020-02-21T10:54:24Z

I'm going to close this because importing speed has massively increased with #41 and will increase further with ipfs/js-ipfs#2771 neither of which compromise atomicity of writes.

hugomrdias added 3 commits October 8, 2019 16:59

feat: add support for batch puts and benchmarks

b7f2970

chore: improve benchmarks

08cdfd3

fix: fix the timer for batch

a0b5bd5

hugomrdias changed the title ~~This PR adds support for batch puts of block into the datastore.~~ add support for batch puts Oct 10, 2019

hugomrdias requested review from achingbrain, vmx, daviddias and lidel October 10, 2019 13:32

hugomrdias mentioned this pull request Oct 10, 2019

feat: add support for batch puts ipld/js-ipld#238

Closed

achingbrain reviewed Oct 10, 2019

View reviewed changes

achingbrain mentioned this pull request Oct 14, 2019

Should we dial the modularization a bit down? Do we all see the value on having everything as a module? ipfs/js-ipfs#2222

Closed

4 tasks

achingbrain mentioned this pull request Nov 27, 2019

perf: concurrent file import #41

Merged

achingbrain closed this Feb 21, 2020

achingbrain deleted the feat/batch-put branch February 21, 2020 10:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for batch puts #38

add support for batch puts #38

hugomrdias commented Oct 10, 2019 •

edited

Loading

daviddias commented Oct 10, 2019

achingbrain commented Oct 10, 2019

hugomrdias commented Oct 10, 2019

hugomrdias commented Oct 10, 2019

vmx commented Oct 10, 2019

achingbrain commented Oct 10, 2019

hugomrdias commented Oct 10, 2019 •

edited

Loading

daviddias commented Oct 10, 2019

vmx commented Oct 10, 2019

hugomrdias commented Oct 10, 2019

vmx commented Oct 10, 2019

autonome commented Oct 10, 2019

hugomrdias commented Oct 10, 2019

achingbrain commented Oct 10, 2019

achingbrain commented Oct 10, 2019

daviddias commented Oct 10, 2019

achingbrain Oct 10, 2019

mikeal commented Oct 10, 2019

mikeal commented Oct 10, 2019

achingbrain commented Oct 14, 2019

daviddias commented Oct 17, 2019

vmx commented Oct 17, 2019

momack2 commented Nov 5, 2019

achingbrain commented Nov 6, 2019

achingbrain commented Nov 6, 2019

achingbrain commented Feb 21, 2020

add support for batch puts #38

add support for batch puts #38

Conversation

hugomrdias commented Oct 10, 2019 • edited Loading

browser

node

daviddias commented Oct 10, 2019

achingbrain commented Oct 10, 2019

hugomrdias commented Oct 10, 2019

hugomrdias commented Oct 10, 2019

vmx commented Oct 10, 2019

achingbrain commented Oct 10, 2019

hugomrdias commented Oct 10, 2019 • edited Loading

daviddias commented Oct 10, 2019

vmx commented Oct 10, 2019

hugomrdias commented Oct 10, 2019

vmx commented Oct 10, 2019

autonome commented Oct 10, 2019

hugomrdias commented Oct 10, 2019

achingbrain commented Oct 10, 2019

achingbrain commented Oct 10, 2019

daviddias commented Oct 10, 2019

achingbrain Oct 10, 2019

Choose a reason for hiding this comment

mikeal commented Oct 10, 2019

js-unixfsv2

js-unixfsv2-mfs

mikeal commented Oct 10, 2019

achingbrain commented Oct 14, 2019

daviddias commented Oct 17, 2019

vmx commented Oct 17, 2019

momack2 commented Nov 5, 2019

achingbrain commented Nov 6, 2019

achingbrain commented Nov 6, 2019

achingbrain commented Feb 21, 2020

hugomrdias commented Oct 10, 2019 •

edited

Loading

hugomrdias commented Oct 10, 2019 •

edited

Loading

`js-unixfsv2`

`js-unixfsv2-mfs`