-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
2113 using compact incomplete on a library with dynamic schema with a named index can result in an unreadable index #2116
base: master
Are you sure you want to change the base?
Conversation
auto new_df_field_index_count = new_df_descriptor.index().type() == IndexDescriptor::Type::EMPTY ? 0 : new_df_descriptor.index().field_count(); | ||
|
||
// If either index is empty, we consider them to match | ||
if (df_in_store_index_field_count == 0 || new_df_field_index_count == 0) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This check is to accommodate the existing behavior around empty DFs and Series, both of which have essentially empty indexes, even though for series the types is RowCount, I think
The benchmarks are reporting ~30% performance degradation on the FinalizeStagedData benchmarks:
I think that this is due to the new check over all of the segments to make sure that the index names are the same, which was not done before. IGNORE THIS: The latest commit fixes this - 300ae92 |
@@ -86,6 +86,31 @@ inline void check_normalization_index_match( | |||
} | |||
} | |||
|
|||
inline bool index_names_match( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we add schema_checks.cpp and add this there?
I think it's worth adding a test for |
Reference Issues/PRs
Fixes #2113
What does this implement or fix?
Any other comments?
Checklist
Checklist for code changes...