feat: Support MPS during training and inference #3100

ori-kron-wis · 2024-12-17T14:18:42Z

add support for Mac GPU (m1,m2,m3) for scvi and revert it to be default option when running from Mac (if available)

Testing are done manually on my MAC (has m3) + verification that still works on CPU/CUDA here

References:
pytorch/pytorch#132605
pytorch/pytorch#77764
https://discourse.scverse.org/t/macbook-m1-m2-mps-acceleration-with-scvi/2075/7

…ult option when running from mac (if availble)

codecov · 2024-12-17T14:26:38Z

Codecov Report

Attention: Patch coverage is 75.96154% with 25 lines in your changes missing coverage. Please review.

Project coverage is 83.20%. Comparing base (6ae39a4) to head (cc9a16e).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
src/scvi/train/_trainingplans.py	44.44%	5 Missing ⚠️
src/scvi/data/_preprocessing.py	66.66%	4 Missing ⚠️
src/scvi/module/_autozivae.py	63.63%	4 Missing ⚠️
src/scvi/distributions/_negative_binomial.py	90.62%	3 Missing ⚠️
src/scvi/external/velovi/_module.py	70.00%	3 Missing ⚠️
src/scvi/model/_utils.py	40.00%	3 Missing ⚠️
src/scvi/external/decipher/_module.py	66.66%	1 Missing ⚠️
src/scvi/model/base/_rnamixin.py	91.66%	1 Missing ⚠️
src/scvi/nn/_base_components.py	66.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3100      +/-   ##
==========================================
- Coverage   87.67%   83.20%   -4.48%     
==========================================
  Files         180      180              
  Lines       15187    15227      +40     
==========================================
- Hits        13315    12669     -646     
- Misses       1872     2558     +686

Files with missing lines	Coverage Δ
src/scvi/external/cellassign/_module.py	`97.24% <100.00%> (ø)`
src/scvi/model/_totalvi.py	`87.29% <100.00%> (ø)`
src/scvi/module/_mrdeconv.py	`95.13% <100.00%> (ø)`
src/scvi/module/_vae.py	`94.92% <100.00%> (ø)`
src/scvi/external/decipher/_module.py	`98.76% <66.66%> (-1.24%)`	⬇️
src/scvi/model/base/_rnamixin.py	`94.46% <91.66%> (ø)`
src/scvi/nn/_base_components.py	`94.77% <66.66%> (-0.34%)`	⬇️
src/scvi/distributions/_negative_binomial.py	`83.26% <90.62%> (-0.65%)`	⬇️
src/scvi/external/velovi/_module.py	`81.36% <70.00%> (+0.17%)`	⬆️
src/scvi/model/_utils.py	`88.74% <40.00%> (-3.21%)`	⬇️
... and 3 more

... and 9 files with indirect coverage changes

canergen · 2024-12-17T16:52:23Z

Please enable the Mac Runner for changes about MPS.

…odules

ori-kron-wis · 2024-12-18T17:23:22Z

A comparison for training CPU, MPS M3 AND NVidia RTX 6000 Ada 48GB GDDR6

canergen · 2024-12-18T17:38:39Z

The difference was much bigger in my hands when increasing batch size (larger batch size creates larger matrix multiplications where MPS is more efficient than CPU) and compilation. It was actually faster than an A100 on Google Colab then. We should optimize this a bit.

canergen · 2024-12-18T17:39:36Z

src/scvi/data/_preprocessing.py

-            np.asarray(1.0 - (data > 0).sum(0) / data.shape[0]).ravel()
-        ).to(device)
+        # in MPS we need to first change to float 32, as the MPS framework doesn't support float64.
+        if device.type == "mps":


We could also do it for other devices. Float32 is sufficient for all computations.

for more information, see https://pre-commit.ci

…vi-tools into Ori-broadcast_all_fix

for more information, see https://pre-commit.ci

…plicitly to use it)

add support for mac gpu (m1,m2,m3) for scvi and revcert it to be defa…

9a04d67

…ult option when running from mac (if availble)

ori-kron-wis added cuda tests Run test suite on CUDA on-merge: backport to 1.2.x on-merge: backport to 1.2.x labels Dec 17, 2024

ori-kron-wis added this to the scvi-tools 1.2 milestone Dec 17, 2024

ori-kron-wis self-assigned this Dec 17, 2024

fixed several other tests for mac mps

96aa9a7

ori-kron-wis added the macos tests label Dec 18, 2024

added contiguous, updated macos tests and revert is_mps changes for m…

d6c9987

…odules

ori-kron-wis added mps and removed macos tests labels Dec 18, 2024

add a local MPS test, several more fixes

f552fc4

ori-kron-wis added the macos tests label Dec 18, 2024

ori-kron-wis added 5 commits December 18, 2024 14:15

more fixes

990b590

more fixes

a0babac

more fixes

6e2f4df

more fixes

5f8a9ff

more fixes

3ce016d

ori-kron-wis marked this pull request as ready for review December 18, 2024 15:57

canergen reviewed Dec 18, 2024

View reviewed changes

canergen and others added 5 commits December 18, 2024 13:33

Fix remaining models for MPS.

f117526

[pre-commit.ci] auto fixes from pre-commit.com hooks

3b918d0

for more information, see https://pre-commit.ci

Fixed n_samples for MPS device

9268bd0

Merge branch 'Ori-broadcast_all_fix' of https://github.com/scverse/sc…

72c589f

…vi-tools into Ori-broadcast_all_fix

[pre-commit.ci] auto fixes from pre-commit.com hooks

900ee26

for more information, see https://pre-commit.ci

ori-kron-wis removed the macos tests label Dec 19, 2024

destvi fix check

ddc9e69

ori-kron-wis and others added 5 commits December 19, 2024 17:43

update quickstart tutorial with mps run

56e1c78

Clean up _get_denoised_samples

aebe2e2

Removed line in DestVI for MPS.

895f29f

Removed comment for MPS.

1411331

Added torch compile option

2ebb7af

ori-kron-wis removed the mps label Dec 22, 2024

Merge branch 'main' into Ori-broadcast_all_fix

7c8b41e

canergen changed the title ~~feat: Support MPS during training~~ feat: Support MPS during training and inference Dec 22, 2024

ori-kron-wis added 3 commits December 22, 2024 23:21

Merge branch 'main' into Ori-broadcast_all_fix

48bce83

revert cpu to deafult accelerator in case of mps (need to have mps ex…

980dd1d

…plicitly to use it)

Merge branch 'main' into Ori-broadcast_all_fix

cc9a16e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support MPS during training and inference #3100

feat: Support MPS during training and inference #3100

ori-kron-wis commented Dec 17, 2024 •

edited

Loading

codecov bot commented Dec 17, 2024 •

edited

Loading

canergen commented Dec 17, 2024

ori-kron-wis commented Dec 18, 2024

canergen commented Dec 18, 2024

canergen Dec 18, 2024

feat: Support MPS during training and inference #3100

Are you sure you want to change the base?

feat: Support MPS during training and inference #3100

Conversation

ori-kron-wis commented Dec 17, 2024 • edited Loading

codecov bot commented Dec 17, 2024 • edited Loading

Codecov Report

canergen commented Dec 17, 2024

ori-kron-wis commented Dec 18, 2024

canergen commented Dec 18, 2024

canergen Dec 18, 2024

Choose a reason for hiding this comment

ori-kron-wis commented Dec 17, 2024 •

edited

Loading

codecov bot commented Dec 17, 2024 •

edited

Loading