Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide prefix sizes #81

Open
ErinWeisbart opened this issue Sep 7, 2023 · 2 comments
Open

Provide prefix sizes #81

ErinWeisbart opened this issue Sep 7, 2023 · 2 comments
Labels

Comments

@ErinWeisbart
Copy link
Contributor

I think it would be helpful to provide a breakdown of data sizes by source/numerical data/image data so people have an idea of what they're getting into before downloading without having to list the bucket themselves.

I'm not sure how much is still in flux, but our dashboard auto-calculated these prefixes current as of right now. I'm happy to flesh out/update.

source images size (TB) workspace size (TB) workspace_dl size (TB) total size (TB)
1 13.2
2 7.6 10.8 21.6
3 16.6 20.6 42.5
4 17.6 17.3 39.1
5 13.1 32 7.4
6 11.7 25.8 43.7
7 14.9
8 7.2 12.1 24.4
9 9.2 17.8 7.1
10 7.5 11.3 21.6
11 10.3 21.6
13 15.8 6.8
@ErinWeisbart
Copy link
Contributor Author

(This is what's in cellpainting-gallery/cpg00016-jump)
I'm planning on providing the total size in the cellpainting-gallery README but I think a by-source breakdown belongs in this repo.

@shntnu shntnu added the cpg0016 label Dec 8, 2023
@ErinWeisbart
Copy link
Contributor Author

FYI when you're ready to add this to a new data release, these can now be quickly and easily calculated with https://github.com/broadinstitute/cpg

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants