Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closes #3219: Optimize old Parquet srting read code #3220

Draft
wants to merge 15 commits into
base: master
Choose a base branch
from

Conversation

bmcdonald3
Copy link
Contributor

After seeing more results gathered on different machines showing mixed results for the new Parquet string optimization, we have decided to make some changes and go back towards a simpler optimization that we are more confident in across the board optimizations, though, they may be more minor than the "best case" with the optimized version, but we shouldn't be seeing any cases get worse with this approach, which seems preferable.

@bmcdonald3 bmcdonald3 force-pushed the string-reg-opt branch 3 times, most recently from 6881f7f to 884b94e Compare June 10, 2024 16:10
@bmcdonald3
Copy link
Contributor Author

@stress-tess I think this PR is finally ready to go. A similar pattern to this can be applied everywhere where we've been rolling back the batch optimization in favor of the single reads to accomodate null values.

I am still gathering some performance numbers on various machines, but if we could get this into the release this week, that'd be great.

@bmcdonald3
Copy link
Contributor Author

bmcdonald3 commented Jun 11, 2024

This will pass CI after #3312 is merged (and we rebase on top of it)

@bmcdonald3 bmcdonald3 marked this pull request as draft June 14, 2024 18:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants