-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding make_column_transformer #14
base: master
Are you sure you want to change the base?
Conversation
Fixed. |
My time is pretty limited at the moment but I really appreciate your
interest and help and will try to give this a careful look soon
|
No worries! Take your time. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
@@ -2,6 +2,7 @@ | |||
from collections import defaultdict as _defaultdict | |||
import itertools as _itertools | |||
|
|||
from sklearn.compose import ColumnTransformer as _ColumnTransformer |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's make this code scikit-learn 0.19-compatible by importing this in make_column_transformer
and skipping the corresponding tests in old versions.
* if a step has estimators of mixed type, the step is named 'alt' | ||
* if there are multiple steps of the same name using the above rules, | ||
a suffix '-1', '-2', etc. is added. | ||
""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add a usage example.
Gave this issue a shot: #13
ColumnTransformer breaks from Pipeline and FeatureUnion's pattern by accepting a list of (encoder, 'column_name') tuples, instead of just a list of encoders. As a result, one question for this PR is whether searchgrid should pass a transformer name to ColumnTransformer.transformer that only contains the encoder class name, or if it should also include the column name. To be consistent with make_pipeline and make_union, I chose only the encoder class name.
@jnothman I'd love any comments or suggestions!