-
I am accessing a 3-column parquet file using
Firstly, are these gets buffered elsewhere, or does this result in three round trips? If so, is there a way to collapse these requests into a single one? Most requests would typically fall within a single row group, so it would also work to read the whole row group in this case. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
If using https://docs.rs/object_store/latest/object_store/trait.ObjectStore.html#method.get_ranges which is what https://docs.rs/parquet/latest/parquet/arrow/async_reader/struct.ParquetObjectReader.html does for each row group, then it will coalesce gaps smaller than https://docs.rs/object_store/latest/object_store/constant.OBJECT_STORE_COALESCE_DEFAULT.html |
Beta Was this translation helpful? Give feedback.
-
Thank you, that helps. I have used the
The If this is the case, do you think it might be better to have smaller row groups, and filter based on row groups rather than using |
Beta Was this translation helpful? Give feedback.
Yes, you will want to use those in combination