-
hi all, I have a doubt. It's like I have divided the data by calling train_test_split of SKLEARN. Then I am removing the outliers using percentile method for training set only. So now my x_train is reduced compared to what it had. But here I have question i.e. even if my y_train still has the same old size and x_train has a different size, My model is getting trained. Won't it be wrong to let the model fitting the data ? If not, how the model is accompanied in the change? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Yes, if you remove outliers from the train set using the Outlier trimmer, then you need to remove from y_train those observations as well. |
Beta Was this translation helpful? Give feedback.
Yes, if you remove outliers from the train set using the Outlier trimmer, then you need to remove from y_train those observations as well.
You can do so my looking at the X_train and y_train indeces.