Draft: dealing with data too large for a single buffer #6138

alphastrata · 2024-08-20T21:23:57Z

Connections
Link to the issues addressed by this PR, or dependent PRs in other repositories
discussion
thread on matrix

Description
The aim of this new example is to demonstrate taking a large input dataset, splitting it into chunks for the purpose of moving it onto the GPU, but then treating it as a single contiguous data structure once on the GPU.

Testing
There's a single test, that runs the same call made in the example itself, which allocates what should be two buffers (on the CI GPU) worth of 0s, and then has the gpu add 1 to them.
lengths of the input and output are asserted to be equal.
contents of the returned array are asserted to all be 1.

Checklist

Run cargo fmt.
Run cargo clippy. If applicable, add:
- --target wasm32-unknown-unknown
- --target wasm32-unknown-emscripten
Run cargo xtask test to run tests.
Add change to CHANGELOG.md. See simple instructions inside file.

…his doesn't supprot anyway)

cwfitzgerald

Sorry for the long wait time for a review!

Frankly as it exists right now, we cannot accept this example. While it physically shows one strategy for dealing with large data sets, after reading it, the user doesn't get a good idea of why that strategy should be used and what problems they are avoiding, compared to the more naive strategy of using larger and larger buffers. Through inline code comments and verbiage in the readme, the reader who has no idea about any of these topics (or even the details of memory allocation) should be able to understand why this is an effective strategy to utilize.

Some things I think it should touch on:

Large buffers may fail to allocate due to fragmentation
Growing/shrinking a dataset with a buffer system requires copying the entire buffer contents, whereas pagenated data just requires rebuilding a bind group.

I'm not going to close this, as I do think this can be transformed into something that would be great to have.

Added a few incidental comments.

cwfitzgerald · 2024-09-23T03:26:39Z

examples/src/big_compute_buffers/tests.rs

@@ -0,0 +1 @@
+


Empty file? This example definitely needs tests

I could use a bit of advice on the test(s) for this: the included shader relies on traversing the contiguous array (made from the multiple buffers) via an OFFSET currently it needs to know this information ahead of compile time, hence the consts.

const OFFSET: u32 = 1u << 8u; const BUFF_LENGTH: u32 = 1u << 25u; const NUM_BUFFERS: u32 = 2u; const TOTAL_SIZE: u32 = BUFF_LENGTH * NUM_BUFFERS;

This is of course assuming these values would make sense for the test environment, I've chosen these values based on looking at the test output from previous runs.

I do not know of a better way in wgsl to assign suitable values to these dynamically at runtime, as opposed to what I've done here.

If any maintainers know a cleaner, or more idiomatic way that we could tweak the shader to avoid doing this, that'd be awesome.

On the fragmentation and pagination notes:

for the fragmentation is a warning similar to what's mentioned in the doc-string on wgpu::Limits enough? We could link to some external docs, perhaps https://developer.nvidia.com/docs/drive/drive-os/archives/6.0.4/linux/sdk/common/topics/graphics_content/avoiding_memory_fragmentation.html

Perhaps a pagination example is also required that this example could link to? [I would be interested in working on that]

examples/src/big_compute_buffers/README.md

alphastrata · 2024-09-23T07:42:55Z

Sorry for the long wait time for a review!

Frankly as it exists right now, we cannot accept this example. While it physically shows one strategy for dealing with large data sets, after reading it, the user doesn't get a good idea of why that strategy should be used and what problems they are avoiding, compared to the more naive strategy of using larger and larger buffers. Through inline code comments and verbiage in the readme, the reader who has no idea about any of these topics (or even the details of memory allocation) should be able to understand why this is an effective strategy to utilize.

Some things I think it should touch on:

Large buffers may fail to allocate due to fragmentation

Growing/shrinking a dataset with a buffer system requires copying the entire buffer contents, whereas pagenated data just requires rebuilding a bind group.

I'm not going to close this, as I do think this can be transformed into something that would be great to have.

Added a few incidental comments.

Cheers, I'll keep working on it.

wip: get a test going

wip: remove unrequired pub(s)...

alphastrata

Added a test.

Asked some questions.

Attempted to simplify.

alphastrata · 2024-10-01T23:53:44Z

examples/src/big_compute_buffers/tests.rs

@@ -0,0 +1 @@
+


I could use a bit of advice on the test(s) for this: the included shader relies on traversing the contiguous array (made from the multiple buffers) via an OFFSET currently it needs to know this information ahead of compile time, hence the consts.

const OFFSET: u32 = 1u << 8u; const BUFF_LENGTH: u32 = 1u << 25u; const NUM_BUFFERS: u32 = 2u; const TOTAL_SIZE: u32 = BUFF_LENGTH * NUM_BUFFERS;

This is of course assuming these values would make sense for the test environment, I've chosen these values based on looking at the test output from previous runs.

I do not know of a better way in wgsl to assign suitable values to these dynamically at runtime, as opposed to what I've done here.

If any maintainers know a cleaner, or more idiomatic way that we could tweak the shader to avoid doing this, that'd be awesome.

alphastrata · 2024-10-02T06:19:10Z

examples/src/big_compute_buffers/tests.rs

@@ -0,0 +1 @@
+


On the fragmentation and pagination notes:

for the fragmentation is a warning similar to what's mentioned in the doc-string on wgpu::Limits enough? We could link to some external docs, perhaps https://developer.nvidia.com/docs/drive/drive-os/archives/6.0.4/linux/sdk/common/topics/graphics_content/avoiding_memory_fragmentation.html

Perhaps a pagination example is also required that this example could link to? [I would be interested in working on that]

alphastrata added 8 commits August 20, 2024 07:44

init files, dir structure

e076832

wip: it's working need to suss out the readme and some of the consts...

d6a2f5d

ok that's probably good enough for a first pass...

218ad7f

chore: spelling

b3218bd

chore: readme tweaks

f72ebb0

Merge branch 'gfx-rs:trunk' into trunk

ac41f98

chore: clippy and fmt

87ed862

chore: add self and changes to changelog.md

126a996

alphastrata marked this pull request as ready for review August 20, 2024 22:28

alphastrata requested a review from a team as a code owner August 20, 2024 22:28

alphastrata changed the title ~~DRAFT: dealing with data too large for a single buffer~~ dealing with data too large for a single buffer Aug 22, 2024

alphastrata and others added 6 commits August 23, 2024 15:55

Merge branch 'trunk' into trunk

50600c3

fix: typo and remove env_logger via cfg flag for wasm builds (which t…

4517673

…his doesn't supprot anyway)

Merge branch 'trunk' into trunk

c8328be

Merge branch 'trunk' into trunk

8ec5d28

Merge branch 'gfx-rs:trunk' into trunk

e8bb012

Merge branch 'trunk' into trunk

207f8d6

cwfitzgerald requested changes Sep 23, 2024

View reviewed changes

Merge branch 'gfx-rs:trunk' into trunk

28fbd9c

alphastrata changed the title ~~dealing with data too large for a single buffer~~ Draft: dealing with data too large for a single buffer Sep 29, 2024

alphastrata marked this pull request as draft September 29, 2024 00:10

alphastrata added 8 commits September 29, 2024 10:10

Merge branch 'trunk' into trunk

7e8e42a

refactor: bring inline with newer wgpu

85f5633

refactor: bring inline with newer wgpu

451047b

chore: work on the readme a bit...

2d58b9c

refactor: remove a bunch of everything, be simple

38a44df

wip: get a test going

164374b

wip: remove unrequired pub(s)...

95a34e4

refactor: remove a bunch of everything, be simple

67a0606

wip: get a test going

alphastrata added 4 commits October 2, 2024 10:06

wip: remove unrequired pub(s)...

a7d306e

wip: remove unrequired pub(s)...

Merge branch 'tmp' into trunk

7d54d5f

Merge branch 'gfx-rs:trunk' into trunk

6c8e193

chore: cleanups, typos, simplifying

64f0697

alphastrata commented Oct 2, 2024

View reviewed changes

alphastrata requested a review from cwfitzgerald October 2, 2024 06:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: dealing with data too large for a single buffer #6138

Draft: dealing with data too large for a single buffer #6138

alphastrata commented Aug 20, 2024 •

edited

Loading

cwfitzgerald left a comment

cwfitzgerald Sep 23, 2024

alphastrata Oct 1, 2024

alphastrata Oct 2, 2024

alphastrata commented Sep 23, 2024

alphastrata left a comment

alphastrata Oct 1, 2024

alphastrata Oct 2, 2024

Draft: dealing with data too large for a single buffer #6138

Are you sure you want to change the base?

Draft: dealing with data too large for a single buffer #6138

Conversation

alphastrata commented Aug 20, 2024 • edited Loading

cwfitzgerald left a comment

Choose a reason for hiding this comment

cwfitzgerald Sep 23, 2024

Choose a reason for hiding this comment

alphastrata Oct 1, 2024

Choose a reason for hiding this comment

alphastrata Oct 2, 2024

Choose a reason for hiding this comment

alphastrata commented Sep 23, 2024

alphastrata left a comment

Choose a reason for hiding this comment

alphastrata Oct 1, 2024

Choose a reason for hiding this comment

alphastrata Oct 2, 2024

Choose a reason for hiding this comment

alphastrata commented Aug 20, 2024 •

edited

Loading