Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HDF5 settings: chunking, compression etc. #44

Open
4 tasks
gipert opened this issue May 4, 2024 · 1 comment
Open
4 tasks

HDF5 settings: chunking, compression etc. #44

gipert opened this issue May 4, 2024 · 1 comment
Labels
performance Computing performance

Comments

@gipert
Copy link
Member

gipert commented May 4, 2024

  • Decide if we want to keep using our custom waveform compressors or just an HDF5 filter (e.g. GZIP)
  • Decide on the HDF5 filter for the higher tiers: GZIP is old and slow, can we use SZIP (builtin since HDF5 v1.6)?
  • Should we run bit shuffle first?
  • What is the optimal chunk size for higher tier data?

Current raw tier HDF5 settings: https://github.com/legend-exp/legend-dataflow-config/tree/main/tier_raw

CC @lvarriano

@gipert gipert added the performance Computing performance label Nov 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance Computing performance
Projects
None yet
Development

No branches or pull requests

2 participants