Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ci: merge cached Nix stores #3071

Merged
merged 1 commit into from
Dec 5, 2023

Conversation

develop7
Copy link
Collaborator

@develop7 develop7 commented Nov 28, 2023

Introduces a nix cache warm-up job, which prefetches all the derivations we ever need to the Nix store, which is cached right away. Such cache is reused then in further actions, reducing in-job downloads to zero.

Before this PR every nix-involving job (13 of them on Linux) took 3 to 4 minutes to do "nix setup" due to downloading 121 to 1409 paths, 300 to 1360 MB big. While these jobs cache Nix store here and there, there's a handful of them, multiplied by 2.3-ish GB in size, and thus they exceed the upper cache limit for GH Actions of 10 GB.
After merging this PR, a cold run takes ~6 minutes to warm up the cache, then every nix-involving job takes 1:30 mins to unpack the cache and have all the derivations installed. A warm run takes 1 minute of a no-op cache warm up step and the same 1:30 to unpack the cache. This produces one Nix store cache per OS of 2.3 GB in size, which fit the storage nicely and would probably be a benefit for every build, for main and PR branches as well.

TODO: make the loadtest workflow use a common cache as well, maybe by extracting the "warm up" job to a separate workflow.

@develop7 develop7 force-pushed the feat-ci_merge_nix_caches branch 2 times, most recently from 2f21971 to dd5da89 Compare November 28, 2023 15:56
@develop7
Copy link
Collaborator Author

@laurenceisla @steve-chavez could you guys adopt this PR so we could test if purging individual caches actually works (and cache space is saved) because it looks like I'm being hit by the lack of permissions again (see here)?

@develop7
Copy link
Collaborator Author

develop7 commented Nov 29, 2023 via email

@develop7
Copy link
Collaborator Author

develop7 commented Dec 1, 2023

Okay, I've extracted the job to a separate workflow, now it should be merged to main to be triggered (see the note at https://docs.github.com/en/actions/using-workflows/events-that-trigger-workflows#workflow_run)

@laurenceisla @steve-chavez

@develop7
Copy link
Collaborator Author

develop7 commented Dec 3, 2023

Another approach I can think of is to prepopulate nix store with all the related derivations in a separate job or even workflow, cache it and reuse this cache in subsequent jobs.

Let me try this.

@develop7
Copy link
Collaborator Author

develop7 commented Dec 3, 2023

So prepopulating job takes 6 minutes, produces one cache 2.3G big, all the "nix setup" steps in the nix-involving jobs on Linux take 1:30 mins tops.

@develop7 develop7 changed the title ci: enable merge-nix-caches-linux job ci: merge cached Nix stores Dec 3, 2023
@develop7 develop7 force-pushed the feat-ci_merge_nix_caches branch from ac6abb7 to 6e114da Compare December 4, 2023 13:17
@develop7 develop7 marked this pull request as ready for review December 4, 2023 13:45
@develop7 develop7 force-pushed the feat-ci_merge_nix_caches branch from 6e114da to 759e4ce Compare December 4, 2023 15:24
@develop7 develop7 force-pushed the feat-ci_merge_nix_caches branch from 391426d to cfa8d0d Compare December 5, 2023 14:46
@develop7 develop7 requested a review from laurenceisla December 5, 2023 15:40
@develop7 develop7 force-pushed the feat-ci_merge_nix_caches branch from 0808511 to 9a9dbbb Compare December 5, 2023 15:52
@develop7 develop7 force-pushed the feat-ci_merge_nix_caches branch from 9a9dbbb to dd8a518 Compare December 5, 2023 15:58
Copy link
Member

@laurenceisla laurenceisla left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great! Took 8 mins to complete all the tests, and that's mostly due to the macOS build. GJ!

@laurenceisla laurenceisla merged commit 7208336 into PostgREST:main Dec 5, 2023
30 checks passed
@laurenceisla
Copy link
Member

laurenceisla commented Dec 5, 2023

It's faster now! But I noticed these logs are starting to show (they're also present in this PR's Actions), e.g.:

/usr/bin/tar: ../../../../../nix/store/jad0xjhpmbb7hhsi3r0sp9llbhcy1qqb-aws-c-mqtt-0.8.14/include/aws/mqtt/v5/mqtt5_listener.h: Cannot open: File exists
/usr/bin/tar: ../../../../../nix/store/jad0xjhpmbb7hhsi3r0sp9llbhcy1qqb-aws-c-mqtt-0.8.14/include/aws/mqtt/v5/mqtt5_listener.h: Cannot open: File exists
/usr/bin/tar: ../../../../../nix/store/jad0xjhpmbb7hhsi3r0sp9llbhcy1qqb-aws-c-mqtt-0.8.14/include/aws/mqtt/v5/mqtt5_client.h: Cannot open: File exists
/usr/bin/tar: ../../../../../nix/store/jad0xjhpmbb7hhsi3r0sp9llbhcy1qqb-aws-c-mqtt-0.8.14/include/aws/mqtt/v5/mqtt5_types.h: Cannot open: File exists
...

https://github.com/PostgREST/postgrest/actions/runs/7105291857/job/19342398874#step:3:68

That didn't happen before:

https://github.com/PostgREST/postgrest/actions/runs/7092294352/job/19303274920#step:3:68

I believe the Jobs could be even faster if that didn't happen. Not sure where the problem is though, the only difference I see for that step is that restore-key-hit is false now.

@develop7
Copy link
Collaborator Author

develop7 commented Dec 5, 2023

But I noticed these logs are starting to show (they're also present in this PR's Actions), e.g.:

Noticed those too it appears to be fine: https://github.com/nix-community/cache-nix-action/#limitations

@laurenceisla
Copy link
Member

Ah, it's a known issue then. OK, let's leave it at that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

2 participants