Intercept fetches to R2 and use direct CARPARK pull #141

hannahhoward · 2025-01-04T20:52:16Z

Goals

When we fetch blocks from an R2 CARPark URL, we appear to get TTFB around 400ms. Interestingly, if we instead use Cloudflare Worker's direct R2 access, our TTFB drops to around 200ms. Bandwidth doesn't seem to change much, but this is still an important win.

Implementation

Using a seperate change in blob-fetcher to allow passing a custom fetch implementation, we now write a version of fetch that checks URLs to see if they are CAR Park URLs. if they are, instead of using the fetch API, we use our direct connection to CARPark in the worker to perform the fetch request.
Pass this custom fetch to the batching fetch used by content claims dagula
Also, to assemble a browser like Response object use the existing code in withCarBlockHandler, factored out to utility file
In the process of doing this, I discovered a bug in that code where it was treating the last byte in the range as exclusive to the returned content rather inclusive, and this was undetected as the test was expecting the wrong result.

For Discussion

Comparison traces, note the average TTFB of 200 vs 400 on requests to R2

Regular fetch:

With intercepted direct gets to R2:

hannahhoward · 2025-01-04T20:53:26Z

blocked by storacha/blob-fetcher#30

fforbeck

LGTM!

alanshaw

Code change looks good to me but does require non-multipart byte range requests from the fetcher.

Do you see the 200ms perf issue when hitting the public URL directly...or just from the worker? I'm just wondering if it's the public bucket code that needs optimizing? https://github.com/storacha/public-bucket - looks like it does a bucket.head then bucket.get for all requests - perhaps that's the difference?

alanshaw · 2025-01-06T15:28:14Z

src/middleware/withCarParkFetch.js

+        if (rangeHeader) {
+          try {
+            range = parseRange(rangeHeader)
+          } catch (err) {


Suggested change

} catch (err) {

} catch (err) {

console.warn('parsing range header', err)

hannahhoward · 2025-01-08T02:24:54Z

@alanshaw now uses the public public handler directly. perf looks really good:

hannahhoward changed the base branch from main to fix/tracing-usage January 4, 2025 20:52

hannahhoward mentioned this pull request Jan 4, 2025

feat(fetcher): allow passing a custom fetch implementation storacha/blob-fetcher#31

Merged

hannahhoward requested review from alanshaw and fforbeck January 4, 2025 20:53

Base automatically changed from fix/tracing-usage to main January 5, 2025 00:02

hannahhoward force-pushed the feat/use-carpark-fetch branch from 24f687a to d79f917 Compare January 5, 2025 00:03

fforbeck approved these changes Jan 5, 2025

View reviewed changes

alanshaw approved these changes Jan 6, 2025

View reviewed changes

hannahhoward added 3 commits January 7, 2025 14:55

feat(middleware): add R2 interceptor

5182fa3

refactor(middleware): fix incompatibilities across Response

f822f4e

refactor(middleware): use public bucket handler directly

b2cf317

hannahhoward force-pushed the feat/use-carpark-fetch branch 2 times, most recently from 51e9393 to e0ba7c8 Compare January 8, 2025 02:15

fix(deps): use pre-release handler for now

c1944ff

hannahhoward force-pushed the feat/use-carpark-fetch branch from e0ba7c8 to c1944ff Compare January 8, 2025 02:22

chore(deps): used released versions of packages

45e6c1f

hannahhoward merged commit 4daf8a3 into main Jan 9, 2025
1 check passed

hannahhoward deleted the feat/use-carpark-fetch branch January 9, 2025 02:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intercept fetches to R2 and use direct CARPARK pull #141

Intercept fetches to R2 and use direct CARPARK pull #141

hannahhoward commented Jan 4, 2025 •

edited

Loading

hannahhoward commented Jan 4, 2025 •

edited

Loading

fforbeck left a comment

alanshaw left a comment

alanshaw Jan 6, 2025

hannahhoward commented Jan 8, 2025

	} catch (err) {
	} catch (err) {
	console.warn('parsing range header', err)

Intercept fetches to R2 and use direct CARPARK pull #141

Intercept fetches to R2 and use direct CARPARK pull #141

Conversation

hannahhoward commented Jan 4, 2025 • edited Loading

Goals

Implementation

For Discussion

hannahhoward commented Jan 4, 2025 • edited Loading

fforbeck left a comment

Choose a reason for hiding this comment

alanshaw left a comment

Choose a reason for hiding this comment

alanshaw Jan 6, 2025

Choose a reason for hiding this comment

hannahhoward commented Jan 8, 2025

hannahhoward commented Jan 4, 2025 •

edited

Loading

hannahhoward commented Jan 4, 2025 •

edited

Loading