Add cloudflare worker to collect API key usage inside Influx DB #1

pyropy · 2024-12-10T16:13:13Z

Reports API usage by API key to Influx db
Closes Collect & visualise spark-stats API usage based on access tokens space-meridian/roadmap#206

bajtos · 2024-12-10T16:22:33Z

@pyropy in the future, please link to the relevant milestone issues in your pull request descriptions, to make it easier for other to navigate.

In this case:

Collect & visualise spark-stats API usage based on access tokens space-meridian/roadmap#206

index.js

bajtos

It would be great to have at least some basic tests to validate this worker works as intended. What are the best practices for writing automated tests for Cloudflare Workers?

pyropy · 2024-12-10T17:22:34Z

It would be great to have at least some basic tests to validate this worker works as intended. What are the best practices for writing automated tests for Cloudflare Workers?

I have added unit tests. They do have documentation here on testing but I wonder if we need integration tests?

juliangruber · 2024-12-11T10:41:19Z

Closes space-meridian/roadmap#206

This doesn't close it fully, as the visualization isn't done yet, right?

test/metrics.test.js

README.md

lib/request.js

lib/metrics.js

lib/request.js

.env.example

lib/request.js

Co-authored-by: Julian Gruber <[email protected]>

lib/influx.js

bajtos

You are making great progress!

test/influx.test.js

lib/influx.js

test/influx.test.js

test/worker.test.js

bin/worker.js

test/worker.test.js

bajtos · 2024-12-12T13:04:52Z

README.md

+Other required environment variables include the following:
+- `INFLUX_URL` - InfluxDB URL
+- `INFLUX_DATABASE` - InfluxDB database (bucket) name
+- `INFLUX_METRIC_NAME` - InfluxDB metric name


from(bucket: "api-observability")

I find this confusing. We have spark-api and spark-stats projects, this bucket name leads me to think about spark-api, while we are reporting telemetry for spark-stats.

Proposed bucket name: spark-stats-telemetry or simply spark-stats.

+1 to spark-stats, spark-stats-telemetry is redundant I think in the context of InfluxDB

Let's go with spark-stats once we have this finished. I'll keep this bucket name while in development.

bajtos

The code looks very good now. Besides addressing the existing comments (and a few new ones below), the most important step now is to verify that the proposed architecture (worker - cache - origin integration) works as expected.

test/influx.test.js

bajtos · 2024-12-13T07:54:21Z

wrangler.toml.example

@@ -1,11 +1,10 @@
 name = "cf-metrics"


I propose finding a more descriptive identifier for this worker. I find "cf-metrics" too vague/generic.

ATM, this worker is observing access tokens and reporting the usage data. In the future, we will also want to allow access to authenticated clients only. Also, this worker is specific to spark-stats service.

Here are some alternatives I find more fitting:

spark-stats-auth

spark-stats-interceptor

spark-stats-handler

@juliangruber do you have any thoughts on this?

I also find cf-metrics too generic. Once we have found a name, let's update the repository name to match.

In the future, we will also want to allow access to authenticated clients only

I'm not sure if that isn't better handled in spark-stats, so I'd be cautious to count this as a requirement. To me, its sole purpose is to report api token usage to InfluxDB. What about

spark-stats-request-metrics

spark-stats-user-metrics

@juliangruber spark-stats-request-metrics sound good to me. Miro do you agree?

ping @bajtos

In the future, we will also want to allow access to authenticated clients only

I'm not sure if that isn't better handled in spark-stats, so I'd be cautious to count this as a requirement. To me, its sole purpose is to report api token usage to InfluxDB. What about

In my understanding, we must do authentication at the edge (in a CF worker) because we are heavily caching the responses, and therefore, most requests never reach spark-stats.

Having written that, I am fine with leaving authentication out of the scope of the current discussion.

The name spark-stats-request-metrics sounds good to me.

pyropy · 2024-12-13T11:07:53Z

The code looks very good now. Besides addressing the existing comments (and a few new ones below), the most important step now is to verify that the proposed architecture (worker - cache - origin integration) works as expected.

So I've re-read the docs again and here's my takeway

Conceptually, there are two ways to interact with Cloudflare’s Cache using a Worker:

Call to fetch() in a Workers script. Requests proxied through Cloudflare are cached even without Workers according to a zone’s default or configured behavior (for example, static assets like files ending in .jpg are cached by default). Workers can further customize this behavior by: Setting Cloudflare cache rules (that is, operating on the cf object of a request).

My understanding of above given that our resource (app server running on fly.dev) will still be proxied by cloudflare is that cloudflare will automatically handle caching the responses.

Store responses using the Cache API from a Workers script. This allows caching responses that did not come from an origin and also provides finer control by:

Customizing cache behavior of any asset by setting headers such as Cache-Control on the response passed to cache.put().

Caching responses generated by the Worker itself through cache.put().

For the above my understanding is that if request is not proxied by cloudflare you have to manually put it to the cache, but the fetch itself will be able to get it from cache if it has been cached previously.

bin/worker.js

lib/influx.js

juliangruber · 2024-12-16T12:03:45Z

wrangler.toml.example

@@ -1,11 +1,10 @@
 name = "cf-metrics"


ping @bajtos

Co-authored-by: Julian Gruber <[email protected]>

bin/worker.js

.github/workflows/ci.yml

Cache response inside cloudflare worker

07f4429

pyropy self-assigned this Dec 10, 2024

pyropy requested review from juliangruber and bajtos December 10, 2024 16:13

pyropy mentioned this pull request Dec 10, 2024

Collect & visualise spark-stats API usage based on access tokens space-meridian/roadmap#206

Closed

7 tasks

Do not cache POST requests

6c5318d

bajtos reviewed Dec 10, 2024

View reviewed changes

index.js Outdated Show resolved Hide resolved

Do not cache POST request responses

e2ca738

bajtos reviewed Dec 10, 2024

View reviewed changes

pyropy added 6 commits December 10, 2024 17:27

Simplify request formating code

51b147b

Remove unused function arguments

4c547d7

Use default project layout

467e081

Update example env

0bba446

Move request logic to worker

15abebe

Add unit tests for worker.fetch

5891c88

Add basic docs

df26c99

pyropy changed the title ~~Cache response inside cloudflare worker~~ Add cloudflare worker to collect API key usage inside Influx DB Dec 10, 2024

juliangruber requested changes Dec 11, 2024

View reviewed changes

pyropy and others added 9 commits December 11, 2024 13:06

Update lib/request.js

1ab824c

Co-authored-by: Julian Gruber <[email protected]>

Update lib/metrics.js

3525f3e

Co-authored-by: Julian Gruber <[email protected]>

Update lib/request.js

f6c88ac

Co-authored-by: Julian Gruber <[email protected]>

Update lib/request.js

8732dc3

Co-authored-by: Julian Gruber <[email protected]>

Update .env.example

882792d

Co-authored-by: Julian Gruber <[email protected]>

Move all influx db related code to lib/influx.js

d27848d

Add basic github actions

116b146

Fix failing test

bb1ae1c

Fix missing wranger.toml in test job

a8d3bb9

pyropy added 2 commits December 12, 2024 11:33

Remove unused env variable

4bd5fb2

Report api key as value

700b15a

bajtos reviewed Dec 12, 2024

View reviewed changes

lib/influx.js Outdated Show resolved Hide resolved

bajtos reviewed Dec 12, 2024

View reviewed changes

lib/influx.js Outdated Show resolved Hide resolved

pyropy added 4 commits December 12, 2024 13:37

Edit default env variables and secrets

201b375

Reformat files with newline at EOF

7f844c6

Refactor influx.js module

16cae7e

Reformat JSDoc

ef2ecd0

bajtos requested changes Dec 12, 2024

View reviewed changes

bajtos reviewed Dec 12, 2024

View reviewed changes

pyropy added 4 commits December 12, 2024 14:11

Update test name

842eaf5

Add table tests for influx lib

81e8b56

Refactor how influx lib is tested

8fc77ed

Add test for reportRequestMetric

c43b0d5

bajtos reviewed Dec 13, 2024

View reviewed changes

Improve influx test

081d4ea

Use envsubst to replace env vars in gh actions

bb531a9

pyropy requested review from bajtos and juliangruber December 16, 2024 11:57

juliangruber requested changes Dec 16, 2024

View reviewed changes

pyropy and others added 3 commits December 16, 2024 13:05

Update lib/influx.js

357da93

Co-authored-by: Julian Gruber <[email protected]>

Update bin/worker.js

82feb43

Co-authored-by: Julian Gruber <[email protected]>

Fix imports and mocks for renamed functions in tests

bdc81ee

juliangruber approved these changes Dec 16, 2024

View reviewed changes

bajtos approved these changes Dec 17, 2024

View reviewed changes

bin/worker.js Outdated Show resolved Hide resolved

.github/workflows/ci.yml Show resolved Hide resolved

pyropy added 2 commits December 17, 2024 11:06

Rename project to spark-stats-request-metrics

99c0884

Rename import for reporting metrics to influx

8f5ef87

pyropy merged commit 3d1f119 into main Dec 17, 2024
1 check passed

pyropy deleted the cache-response-inside-cloudflare-worker branch December 17, 2024 10:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cloudflare worker to collect API key usage inside Influx DB #1

Add cloudflare worker to collect API key usage inside Influx DB #1

pyropy commented Dec 10, 2024 •

edited

Loading

bajtos commented Dec 10, 2024

bajtos left a comment

pyropy commented Dec 10, 2024

juliangruber commented Dec 11, 2024

bajtos left a comment

bajtos Dec 12, 2024

juliangruber Dec 12, 2024

pyropy Dec 12, 2024

bajtos left a comment

bajtos Dec 13, 2024

juliangruber Dec 13, 2024

pyropy Dec 13, 2024

juliangruber Dec 16, 2024

bajtos Dec 17, 2024

pyropy commented Dec 13, 2024 •

edited

Loading

juliangruber Dec 16, 2024

Add cloudflare worker to collect API key usage inside Influx DB #1

Add cloudflare worker to collect API key usage inside Influx DB #1

Conversation

pyropy commented Dec 10, 2024 • edited Loading

bajtos commented Dec 10, 2024

bajtos left a comment

Choose a reason for hiding this comment

pyropy commented Dec 10, 2024

juliangruber commented Dec 11, 2024

bajtos left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bajtos left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pyropy commented Dec 13, 2024 • edited Loading

Choose a reason for hiding this comment

pyropy commented Dec 10, 2024 •

edited

Loading

pyropy commented Dec 13, 2024 •

edited

Loading