-
-
Notifications
You must be signed in to change notification settings - Fork 559
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Point number limit in PCA decomposition #920
Comments
hi @hugo-pires Thank you for using Yellowbrick. I don’t think there is an upper-limit but i will verify this with my colleagues. Are you having problems with the PCADecomposition visualizer? |
Thank you @lwgray and congratulations for your work. The questions is that I am projecting an aprox 5000 examples dataframe, but I'm getting an aprox 50 points. Probably they are overlaid, but I would like to doublecheck. Or use some alpha parameter. |
Hi @hugo-pires — thanks for checking out Yellowbrick! There probably is an upper limit for PCA, but 5k samples isn't even close to it! So I agree it's likely that the points are merely being overlaid. You can test this out using the from sklearn.datasets import make_classification
from yellowbrick.features.pca import PCADecomposition
X, y = make_classification(
n_samples=5000, n_features=200, n_informative=2, n_redundant=2
)
visualizer = PCADecomposition(alpha=.25)
visualizer.fit_transform(X, y)
visualizer.poof() The default |
@hugo-pires @rebeccabilbro is absolutely right that you should be able to plot 5000 points and her suggestion on altering alpha is right on. I wanted to add a few things.
|
Thank you @lwgray and @rebeccabilbro . It would also be nice to have auto colors to discrete targets, based on the levels of the label. |
@hugo-pires i think that #476 and #874 will address that request. |
Hello
Is there any limit to the number of points in PCADecomposition plot?
Thank you
The text was updated successfully, but these errors were encountered: