Enhance PCA Decomposition #476

bbengfort · 2018-06-14T14:33:05Z

We should enhance the PCADecomposition visualizer to provide many of the features the Manifold visualizer provides, including things like:

Color points by class with a legend (See Add legend option to PCA visualizer #458)
Color points by heatmap for continuous y and add a colorbar
Add alpha parameter (see Add alpha support for scatter plots #475)
Add random state to pass to PCA
Allow user to pass in a PCA transformer/pipeline
Update tests with better random data sets (more points; see manifold tests)
Include explained variance/noise variance (or explained variance ratio) in chart
Enhance biplots documentation

See also #455 as another enhancement that might not be related to this enhancement.

The text was updated successfully, but these errors were encountered:

rohit-ganapathy · 2019-02-18T07:29:20Z

Hey! i'm interested in tackling this.

bbengfort · 2019-02-18T15:28:45Z

@rohit-ganapathy - that would be great, feel free to open a PR when you're ready for us to take a look.

dnabanita7 · 2019-02-19T00:36:47Z

Can I start working on this,even if @rohit-ganapathy is assigned?

rebeccabilbro · 2019-02-19T02:18:33Z

Hello @naba7 — as we explained last week in response to your questions on #738 and #677, we do not "assign" issues or reserve issues for contributors. Anyone is welcome to submit a PR for a feature or bugfix they work on.

However, given that you already have one PR open that still needs to be completed (#755), have started working on #615, and are new to working on Yellowbrick and still getting to know our API, we would really appreciate if you would focus on getting those first PRs across the finish line before starting anything new.

We appreciate your enthusiasm about contributing to Yellowbrick. One of the most important lessons to learn is that open source is a marathon, not a sprint, so we hope you can be patient and enjoy the journey — we promise Yellowbrick isn't going away!

dnabanita7 · 2019-02-19T02:23:42Z

It's so exciting and fun. I want to know and learn things quick. So,I am asking questions to get assigned everywhere. Sorry I have noted this now.

…

On Tue 19 Feb, 2019, 7:48 AM Rebecca Bilbro ***@***.*** wrote: Hello @naba7 <https://github.com/Naba7> — as we explained last week in response to your questions on #738 <#738> and #677 <#677>, we do not "assign" issues or reserve issues for contributors. Anyone is welcome to submit a PR for a feature or bugfix they work on. However, given that you already have one PR open that still needs to be completed (#755 <#755>), have started working on #615 <#615>, and are new to working on Yellowbrick and still getting to know our API, we would *really* appreciate if you would focus on getting those first PRs across the finish line before starting anything new. We appreciate your enthusiasm about contributing to Yellowbrick. One of the most important lessons to learn is that open source is a marathon, not a sprint, so we hope you can be patient and enjoy the journey — we promise Yellowbrick isn't going away! — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#476 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AeGb9yDVmfn6QhUbUjh1no0PM6763S9eks5vO177gaJpZM4UoE6-> .

bbengfort · 2019-08-29T00:02:52Z

@naresh-bachwani has this issue been fixed by your work this summer?

naresh-bachwani · 2019-08-31T16:03:35Z

@bbengfort I think that explained variance charts are left! But that will be covered in decomposition, right?

bbengfort · 2019-09-04T17:40:15Z

@naresh-bachwani ExplainedVariance is separate to this issue. Would you mind ticking the checkboxes above based on your work?

BradKML · 2022-03-12T10:29:34Z

Can these functions be applied to FastICA in Scikit-Learn (or maybe any ICA)?
Also observing #615 and #316

bbengfort · 2022-03-12T13:53:54Z

@BrandonKMLee very possibly, it wouldn't hurt to try. I think what you'd have to do is change the pca_transformer attribute on the PCA visualizer; establishing it as a pipeline similar to the code here: https://github.com/DistrictDataLabs/yellowbrick/blob/develop/yellowbrick/features/pca.py#L184-L189. This would have to be done after initialization before any call to fit or transform. I don't see any place it wouldn't work, unless FastICA or ICA doesn't have required attributes like n_components_.

You could also try passing an initialized FastICA or ICA transformer as the manifold attribute to the Manifold visualizer - this might not give you the same features as ICA, but should give you the projected visualization.

BradKML · 2022-03-12T14:28:18Z

@bbengfort n_components_in_ for FastICA, but at the same time explained variance could be a problem, as each components are expected to have well-distributed significance instead of being ordered, and also such a function currently does not exist for FastICA.

bbengfort · 2022-05-21T18:35:01Z

@BrandonKMLee ok, that makes sense so potentially FastICA make not work unless we create a specialized manifold for them.

bbengfort added type: feature a new visualizer or utility for yb priority: medium can wait until after next release level: intermediate python coding expertise required labels Jun 14, 2018

bbengfort assigned rebeccabilbro and bbengfort Jun 18, 2018

bbengfort mentioned this issue Sep 17, 2018

Extend PCA Visualizer with Component-Feature Strength #615

Closed

wagner2010 added the GSoC-2019 label Jan 23, 2019

wagner2010 removed the GSoC-2019 label Feb 2, 2019

rebeccabilbro mentioned this issue Feb 19, 2019

Add legend option to PCA visualizer #458

Closed

rohit-ganapathy mentioned this issue Feb 19, 2019

Enhance PCA Decomposition #476 #763

Closed

lwgray mentioned this issue Jul 10, 2019

Point number limit in PCA decomposition #920

Closed

bbengfort mentioned this issue Jul 17, 2019

Extend DataVisualizer and Update DataVisualizer Subclasses #927

Merged

10 tasks

rebeccabilbro mentioned this issue Jul 22, 2019

ProjectionVisualizer: unifying functionality of PCA and Manifold #874

Closed

bbengfort closed this as completed Aug 29, 2019

bbengfort reopened this Aug 29, 2019

gregparkes mentioned this issue Apr 22, 2022

Include explained variance in PCA component plot #1239

Closed

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance PCA Decomposition #476

Enhance PCA Decomposition #476

bbengfort commented Jun 14, 2018 •

edited

Loading

rohit-ganapathy commented Feb 18, 2019

bbengfort commented Feb 18, 2019

dnabanita7 commented Feb 19, 2019

rebeccabilbro commented Feb 19, 2019

dnabanita7 commented Feb 19, 2019 via email

bbengfort commented Aug 29, 2019

naresh-bachwani commented Aug 31, 2019

bbengfort commented Sep 4, 2019

BradKML commented Mar 12, 2022

bbengfort commented Mar 12, 2022

BradKML commented Mar 12, 2022 •

edited

Loading

bbengfort commented May 21, 2022

Enhance PCA Decomposition #476

Enhance PCA Decomposition #476

Comments

bbengfort commented Jun 14, 2018 • edited Loading

rohit-ganapathy commented Feb 18, 2019

bbengfort commented Feb 18, 2019

dnabanita7 commented Feb 19, 2019

rebeccabilbro commented Feb 19, 2019

dnabanita7 commented Feb 19, 2019 via email

bbengfort commented Aug 29, 2019

naresh-bachwani commented Aug 31, 2019

bbengfort commented Sep 4, 2019

BradKML commented Mar 12, 2022

bbengfort commented Mar 12, 2022

BradKML commented Mar 12, 2022 • edited Loading

bbengfort commented May 21, 2022

bbengfort commented Jun 14, 2018 •

edited

Loading

BradKML commented Mar 12, 2022 •

edited

Loading