ENH: SPMD interface for IncrementalPCA #1979

olegkkruglov · 2024-07-31T00:02:27Z

Description

Added SPMD interface for IncrementalEmpiricalCovariance
Changed policy saving workflow, now queue is saved to attributes instead of policy. It is necessary because finalize_fit requires spmd_policy, but partial_fit requires data_parallel_policy on oneDAL side
finalize_fit now uses provided queue for computations on onedal4py side.
Contains some content from TEST: test coverage for sklearnex SPMD ifaces #1777 for test implementation

Checklist to comply with before moving PR from draft:

PR completeness and readability

I have reviewed my changes thoroughly before submitting this pull request.
I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with update and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have added a respective label(s) to PR if I have a permission for that.
I have resolved any merge conflicts that might occur with the base branch.

Testing

The unit tests pass successfully.
I have run it locally and tested the changes extensively.

Performance

I have measured performance for affected algorithms using scikit-learn_bench and provided at least summary table with measured data, if performance change is expected.
I have provided justification why performance has changed or why changes are not expected.

samir-nasibli · 2024-08-05T12:37:05Z

@olegkkruglov please rebase your branch

ethanglaser · 2024-08-20T00:54:27Z

/intelci: run

onedal/spmd/decomposition/incremental_pca.py

ethanglaser · 2024-08-20T18:47:07Z

https://intel-ci.intel.com/ef5f226d-b65f-f196-993d-a4bf010d0e2e

olegkkruglov · 2024-08-26T13:46:20Z

/intelci: run

samir-nasibli · 2024-08-30T06:11:41Z

samir-nasibli

@olegkkruglov All CI checks should be included to the PR before making it ready for the review. For SPMD algos this is mandatory to have internal CI run.

onedal/decomposition/incremental_pca.py

onedal/spmd/decomposition/incremental_pca.py

samir-nasibli · 2024-08-30T06:21:35Z

/intelci: run

olegkkruglov · 2024-08-30T12:15:26Z

/intelci: run

olegkkruglov · 2024-08-30T13:23:16Z

https://intel-ci.intel.com/ef66d2e7-6006-f173-ab00-a4bf010d0e2e

olegkkruglov · 2024-09-02T12:08:26Z

@olegkkruglov Is it possible to update and use global queue in scikit-learn-intelex for the finalize_fit? Could be done as a followup for all Incr algos, just to remove storing queue in models itself

Probably it is possible but what's the point? Serialization problem is already resolved and I don't see any other problems with it

onedal/spmd/decomposition/incremental_pca.py

icfaust

I can't comment much on the implementation of the spmd object side of things, since I really didn't review the initial IncrementalPCA PR. My focus was on similar issue as the other Incremental SPMD PRs for 2025/ testing. I assume the _create_model method comes from that implementation, and that it was justified for sklearn conformance reasons.

sklearnex/spmd/decomposition/tests/test_incremental_pca_spmd.py

samir-nasibli

Please rebase your branch and run CI

onedal/spmd/decomposition/incremental_pca.py

icfaust · 2024-09-04T08:21:14Z

@olegkkruglov ping me when you want another review.

samir-nasibli

Thank you! Expecting other reviewers approvals.
Assuming green CI

olegkkruglov · 2024-09-05T11:56:54Z

/intelci: run

icfaust

Merge whenever you are ready.

sklearnex/spmd/decomposition/incremental_pca.py

ethanglaser · 2024-09-05T15:17:14Z

sklearnex/spmd/decomposition/tests/test_incremental_pca_spmd.py

Interesting that spmd incremental pca tests are passing but not normal pca spmd

olegkkruglov requested review from samir-nasibli and Alexsandruss as code owners July 31, 2024 00:02

olegkkruglov added enhancement New feature or request testing Tests for sklearnex/daal4py/onedal4py & patching sklearn labels Jul 31, 2024

olegkkruglov requested review from icfaust and ethanglaser July 31, 2024 00:03

olegkkruglov force-pushed the incpca-spmd branch from 5b2c92a to cff4aac Compare August 19, 2024 15:34

ethanglaser reviewed Aug 20, 2024

View reviewed changes

onedal/spmd/decomposition/incremental_pca.py Show resolved Hide resolved

uxlfoundation deleted a comment from olegkkruglov Aug 20, 2024

olegkkruglov force-pushed the incpca-spmd branch from cff4aac to 091ad43 Compare August 26, 2024 13:45

icfaust mentioned this pull request Aug 27, 2024

ENH: SPMD interface for IncrementalEmpiricalCovariance #1941

Merged

8 tasks

olegkkruglov force-pushed the incpca-spmd branch from 60755b6 to 3ffcc6f Compare August 28, 2024 16:47

samir-nasibli reviewed Aug 30, 2024

View reviewed changes

olegkkruglov force-pushed the incpca-spmd branch from 3f9d836 to 5438c89 Compare August 30, 2024 12:15

olegkkruglov requested review from samir-nasibli and ethanglaser September 3, 2024 09:23

samir-nasibli reviewed Sep 3, 2024

View reviewed changes

onedal/spmd/decomposition/incremental_pca.py Show resolved Hide resolved

icfaust reviewed Sep 3, 2024

View reviewed changes

icfaust requested a review from samir-nasibli September 3, 2024 13:16

samir-nasibli reviewed Sep 3, 2024

View reviewed changes

onedal/spmd/decomposition/incremental_pca.py Outdated Show resolved Hide resolved

onedal/spmd/decomposition/incremental_pca.py Show resolved Hide resolved

olegkkruglov force-pushed the incpca-spmd branch from ce745a8 to b751aa2 Compare September 3, 2024 18:23

samir-nasibli approved these changes Sep 4, 2024

View reviewed changes

olegkkruglov added 11 commits September 5, 2024 02:33

Add distributed IncrementalPCA

2efc428

rename test file

a56830e

Fix n_components choice and fix tests

8ab5f25

Remove support_usm_ndarray

668fb7e

Remove unused import

6747eb0

Rename class reference

9b6ebd3

Update self._queue in every partial_fit call

8a718da

Fix docstring for partial_fit method at onedal part

5edb5c8

Change naming for base class reference

ca7683c

Address comment

6185b15

Address comments

6339b30

olegkkruglov force-pushed the incpca-spmd branch from f1a6c77 to 9c9edcd Compare September 5, 2024 09:35

Add comment to test

3040971

olegkkruglov force-pushed the incpca-spmd branch from 9c9edcd to 3040971 Compare September 5, 2024 09:36

Update test

0fcd91e

icfaust approved these changes Sep 5, 2024

View reviewed changes

ethanglaser reviewed Sep 5, 2024

View reviewed changes

sklearnex/spmd/decomposition/incremental_pca.py Outdated Show resolved Hide resolved

ethanglaser reviewed Sep 5, 2024

View reviewed changes

Fix docstrings

e0e3587

ethanglaser approved these changes Sep 5, 2024

View reviewed changes

olegkkruglov merged commit 9f63db2 into uxlfoundation:main Sep 5, 2024
9 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: SPMD interface for IncrementalPCA #1979

ENH: SPMD interface for IncrementalPCA #1979

olegkkruglov commented Jul 31, 2024 •

edited

Loading

samir-nasibli commented Aug 5, 2024

ethanglaser commented Aug 20, 2024

ethanglaser commented Aug 20, 2024

olegkkruglov commented Aug 26, 2024

samir-nasibli commented Aug 30, 2024

Description

samir-nasibli left a comment

samir-nasibli commented Aug 30, 2024

olegkkruglov commented Aug 30, 2024

olegkkruglov commented Aug 30, 2024

olegkkruglov commented Sep 2, 2024

icfaust left a comment

samir-nasibli left a comment

icfaust commented Sep 4, 2024

samir-nasibli left a comment

olegkkruglov commented Sep 5, 2024

icfaust left a comment

ethanglaser Sep 5, 2024

ENH: SPMD interface for IncrementalPCA #1979

ENH: SPMD interface for IncrementalPCA #1979

Conversation

olegkkruglov commented Jul 31, 2024 • edited Loading

Description

samir-nasibli commented Aug 5, 2024

ethanglaser commented Aug 20, 2024

ethanglaser commented Aug 20, 2024

olegkkruglov commented Aug 26, 2024

samir-nasibli commented Aug 30, 2024

Description

samir-nasibli left a comment

Choose a reason for hiding this comment

samir-nasibli commented Aug 30, 2024

olegkkruglov commented Aug 30, 2024

olegkkruglov commented Aug 30, 2024

olegkkruglov commented Sep 2, 2024

icfaust left a comment

Choose a reason for hiding this comment

samir-nasibli left a comment

Choose a reason for hiding this comment

icfaust commented Sep 4, 2024

samir-nasibli left a comment

Choose a reason for hiding this comment

olegkkruglov commented Sep 5, 2024

icfaust left a comment

Choose a reason for hiding this comment

ethanglaser Sep 5, 2024

Choose a reason for hiding this comment

olegkkruglov commented Jul 31, 2024 •

edited

Loading