New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

OptimizedLinear implementation #5355

Merged

loadams merged 13 commits into microsoft:master from Snowflake-Labs:optim-linear

Apr 23, 2024

Collaborator

jeffra commented Apr 2, 2024

Optimized version of nn.Linear that adds features such as:
* LoRA w. base weight sharding
* FP [6,8,12] quantization

Depends on #5336 being merged first

Co-authored-by: @rajhans
Co-authored-by: @aurickq


          optimized linear + tests

97a230e

jeffra requested review from mrwyattii, tjruwase and loadams as code owners

April 2, 2024 19:14

jeffra and others added 4 commits

April 4, 2024 14:57


          Merge branch 'master' into optim-linear

7afae40


          some fixes to make lora training work

1f7006a


          clean-up

78e763d


          Merge branch 'master' into optim-linear

fa0b032

sfc-gh-jrasley mentioned this pull request

some fixes to make training work Snowflake-Labs/DeepSpeed#5

Open

tjruwase reviewed

View reviewed changes

deepspeed/linear/optimized_linear.py Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

deepspeed/linear/optimized_linear.py Outdated Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

deepspeed/linear/optimized_linear.py Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

deepspeed/linear/optimized_linear.py Outdated Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

deepspeed/linear/optimized_linear.py Outdated Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

deepspeed/linear/config.py Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

deepspeed/linear/config.py Outdated Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

deepspeed/linear/quantization.py Outdated Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

tests/unit/linear/test_linear.py Outdated Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

tests/unit/linear/test_linear.py Outdated Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

tests/unit/linear/test_linear.py Outdated Show resolved Hide resolved

tjruwase reviewed

View reviewed changes

deepspeed/linear/quantization.py Show resolved Hide resolved

jeffra and others added 2 commits

April 19, 2024 14:42


          Merge branch 'master' into optim-linear

16d72d4


          address comments, new tests, new fixes

980db87

jeffra requested review from awan-10 and arashb as code owners

April 20, 2024 06:00

jeffra and others added 4 commits

April 19, 2024 23:08


          Merge branch 'master' into optim-linear

ffe2223


          add type check for configs

185a68f


          formatting

dc9258c


          Merge branch 'master' into optim-linear

fce174b

tjruwase approved these changes

View reviewed changes

tjruwase added this pull request to the merge queue

github-merge-queue bot removed this pull request from the merge queue due to failed status checks

loadams added this pull request to the merge queue

github-merge-queue bot removed this pull request from the merge queue due to failed status checks

loadams added this pull request to the merge queue

github-merge-queue bot removed this pull request from the merge queue due to failed status checks

jeffra and others added 2 commits

April 23, 2024 09:36


          Merge branch 'master' into optim-linear

f2cbcae


          loosen typechecking of config

efeed39

loadams merged commit 5e6c9b9 into microsoft:master

10 of 12 checks passed

loadams added a commit that referenced this pull request


          Revert "OptimizedLinear implementation (#5355)"

4ab4707

This reverts commit 5e6c9b9.

rraminen pushed a commit to ROCm/DeepSpeed that referenced this pull request


          OptimizedLinear implementation (microsoft#5355)

29726dc

Optimized version of `nn.Linear` that adds features such as:
      * LoRA w. base weight sharding
      * FP [6,8,12] quantization

Depends on microsoft#5336 being merged first

Co-authored-by: @rajhans
Co-authored-by: @aurickq

---------

Co-authored-by: Rajhans Samdani <[email protected]>
Co-authored-by: Jeff Rasley <[email protected]>

umchand pushed a commit to umchand/DeepSpeed that referenced this pull request


          OptimizedLinear implementation (microsoft#5355)

3ba4c01

Optimized version of `nn.Linear` that adds features such as:
      * LoRA w. base weight sharding
      * FP [6,8,12] quantization

Depends on microsoft#5336 being merged first

Co-authored-by: @rajhans
Co-authored-by: @aurickq

---------

Co-authored-by: Rajhans Samdani <[email protected]>
Co-authored-by: Jeff Rasley <[email protected]>

dbyoung18 pushed a commit to dbyoung18/DeepSpeed that referenced this pull request


          OptimizedLinear implementation (microsoft#5355)

7147d19

Optimized version of `nn.Linear` that adds features such as:
      * LoRA w. base weight sharding
      * FP [6,8,12] quantization

Depends on microsoft#5336 being merged first

Co-authored-by: @rajhans
Co-authored-by: @aurickq

---------

Co-authored-by: Rajhans Samdani <[email protected]>
Co-authored-by: Jeff Rasley <[email protected]>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

tjruwase tjruwase approved these changes

mrwyattii Awaiting requested review from mrwyattii

loadams Awaiting requested review from loadams loadams is a code owner

awan-10 Awaiting requested review from awan-10

arashb Awaiting requested review from arashb

Labels

None yet