-
Notifications
You must be signed in to change notification settings - Fork 92
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closes #3219: Optimize old Parquet srting read code #3220
base: master
Are you sure you want to change the base?
Conversation
c63837c
to
73c4268
Compare
6881f7f
to
884b94e
Compare
@stress-tess I think this PR is finally ready to go. A similar pattern to this can be applied everywhere where we've been rolling back the batch optimization in favor of the single reads to accomodate null values. I am still gathering some performance numbers on various machines, but if we could get this into the release this week, that'd be great. |
2692c6c
to
e73043e
Compare
This will pass CI after #3312 is merged (and we rebase on top of it) |
3abef5b
to
290f1e3
Compare
After seeing more results gathered on different machines showing mixed results for the new Parquet string optimization, we have decided to make some changes and go back towards a simpler optimization that we are more confident in across the board optimizations, though, they may be more minor than the "best case" with the optimized version, but we shouldn't be seeing any cases get worse with this approach, which seems preferable.