-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enable StringView by default by passing CI #11862
Conversation
🤯 |
e114800
to
dc075c3
Compare
update here that we now pass |
yay finally gets a green light 🎉 |
@@ -267,7 +267,9 @@ pub(crate) fn convert_schema_to_types(columns: &Fields) -> Vec<DFColumnType> { | |||
| DataType::Float64 | |||
| DataType::Decimal128(_, _) | |||
| DataType::Decimal256(_, _) => DFColumnType::Float, | |||
DataType::Utf8 | DataType::LargeUtf8 => DFColumnType::Text, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
pulled into #12033
I have incorporated the changes related to the arrow upgrade into #12032 There are still changes in this PR related to stringview in statistics / parquet files |
@@ -514,10 +521,13 @@ pub fn statistics_from_parquet_meta_calc( | |||
statistics.total_byte_size = Precision::Exact(total_byte_size); | |||
|
|||
let file_metadata = metadata.file_metadata(); | |||
let file_schema = parquet_to_arrow_schema( | |||
let mut file_schema = parquet_to_arrow_schema( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I messed around with this and I agree this is the least invasive change for supporting string statistics
I feel like there must be a better way, however. I will think about it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I feel like there must be a better way, however.
I agree. I don't like passing the config around either...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Follow up ticket filed #12123
Superceded by #12092 |
Which issue does this PR close?
Part of #11752 and #11682
Rationale for this change
I want to make sure the remaining work to stablize
StringView
is straighforward, so I make a branch to see what it looks like with the next arrow release, and also test the performancew with it.So this PR is not meant to be merged directly, but to help people (if not me) file smaller PRs when that day (arrow release) comes
What changes are included in this PR?
Are these changes tested?
Are there any user-facing changes?