-
Notifications
You must be signed in to change notification settings - Fork 5.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Do Not Review] Set splitsize for hadoop InputFormat to Presto max_split_size #23635
base: master
Are you sure you want to change the base?
Conversation
Any way to add a test case for this? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Tests?
@@ -93,6 +94,7 @@ | |||
import static java.lang.Math.max; | |||
import static java.lang.String.format; | |||
import static java.util.Objects.requireNonNull; | |||
import static org.apache.hadoop.mapreduce.lib.input.FileInputFormat.SPLIT_MINSIZE; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This feels like a large dependency to add just for a constant string. Consider using the literal value instead.
Sure, I will check on how we can add a test for this. |
c81d8f7
to
ebb04e8
Compare
Currently added tests are mainly checking how split generation is affected if we use the Hadoop library directly. |
79e6452
to
a7bc015
Compare
02acbb6
to
9f646e1
Compare
19bef1c
to
22dc0d8
Compare
22dc0d8
to
530dba4
Compare
530dba4
to
2334207
Compare
Description
Set splitsize for hadoop InputFormat to Presto max_split_size
Details in #23608
Motivation and Context
Make splitsize configurable where hadoop InputForma library is used for split generation.
Resolves #23608
Impact
Make splitsize configurable where hadoop InputForma library is used for split generation.
Resolves #23608
Test Plan
Contributor checklist
Release Notes