Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

kozistr / pytorch_optimizer Public

Notifications You must be signed in to change notification settings
Fork 22
Star 250

Code
Issues 8
Pull requests
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Releases: kozistr/pytorch_optimizer

Releases · kozistr/pytorch_optimizer

pytorch-optimizer v2.6.1

22 Apr 12:14

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.6.1

Change Log

Fix

variables are not located on the same device with the gradients, #132 (related to #131) (thanks to @Bing-su)
fix approximate_sq_grad() in Adafactor optimizer, #132

Contributors

Bing-su

Assets 2

Loading

pyapyapya, Bing-su, and gseonglee reacted with rocket emoji

All reactions

🚀 3 reactions

3 people reacted

pytorch-optimizer v2.6.0

22 Apr 07:56

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.6.0

Change Log

Feature

Implement SM3 optimizer, #130
- Memory-Efficient Adaptive Optimization
Tweak Scalable Shampoo optimizer, #128, #129
- implement a new preconditioner type, OUTPUT.
- optimize speed/memory usage of coupled Newton iteration and power iteration methods.
- use in-place operation (SQRT-N Grafting).
- clean-up shampoo_utils more readable.
- support skip_preconditioning_rank_lt parameter to skip preconditioning in case of the low-rank gradient.
- set default value for preconditioning_compute_steps to 1000.
- set default value for start_preconditioning_step to 25.

Assets 2

Loading

Bing-su reacted with rocket emoji

All reactions

🚀 1 reaction

1 person reacted

pytorch-optimizer v2.5.2

11 Apr 13:47

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.5.2

Feature

add eps to stabilize optimizing, Nero optimizer. #121

Fix

fix Ranger21 not to skip updates when the first parameter doesn't have a gradient, #125, #126 (thanks to @jdb78)
fix Lookahead optimizer, #122, #123

Dependency

upgrade to Pytorch 2.0, #123

Contributors

jdb78

Assets 2

Loading

Bing-su and gseonglee reacted with rocket emoji

All reactions

🚀 2 reactions

2 people reacted

pytorch-optimizer v2.5.1

12 Mar 05:48

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.5.1

Change Log

Feature

Implement Ali-G optimizer, #115, #116
- Adaptive Learning Rates for Interpolation with Gradients
Implement create_optimizer() to build the optimizer, #116

Bug

__str__ method, #118, #119 (thanks to @Interpause)

Contributors

Interpause

Assets 2

Loading

All reactions

pytorch-optimizer v2.5.0

15 Feb 05:41

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.5.0

Change Log

Feature

Implement AdaFactor optimizer, #107
- Adaptive Learning Rates with Sublinear Memory Cost
Implement NovoGrad optimizer, #109
- Stochastic Gradient Methods with Layer-wise Adaptive Moments for Training of Deep Networks
Implement Apollo optimizer, #108
- An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization
Implement Lion optimizer, #113
- Symbolic Discovery of Optimization Algorithms

Assets 2

Loading

Bing-su reacted with thumbs up emoji

gseonglee reacted with hooray emoji

All reactions

👍 1 reaction
🎉 1 reaction

2 people reacted

pytorch-optimizer v2.4.2

10 Feb 10:57

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.4.2

Change Log

Bug

Fix to deep-copy inverse preconditioners

Deps

Support Pytorch 2.0, #106 (related to #105)

Docs

Update Scalable Shampoo docstring (more parameter guides), #106
- documentation : https://pytorch-optimizers.readthedocs.io/en/latest/optimizer_api.html#scalableshampoo

Assets 2

Loading

pyapyapya, Bing-su, ZiminPark, and gseonglee reacted with rocket emoji

All reactions

🚀 4 reactions

4 people reacted

pytorch-optimizer v2.4.1

06 Feb 06:34

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.4.1

Change Log

Feature

Rename the new Shampoo to ScalableShampoo. #103
Implement the old(?) version of Shampoo optimizer. #103
Support SVD method to calculate the inverse pth root matrix. #103
- to boost the M^{-1/p} calculation, performs batched SVD when available.
Implement AdamS optimizer. #102
Support stable weight decay option for Adai optimizer. #102

Bug

Fix compute_power_svd() to get a singular value. #104

Assets 2

Loading

All reactions

pytorch-optimizer v2.4.0

02 Feb 10:52

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.4.0

Change Log

Feature

Implement D-Adaptation optimizers (DAdaptAdaGrad, DAdaptAdam, DAdaptSGD), #101
- Learning rate free learning for SGD, AdaGrad and Adam
- original implementation: https://github.com/facebookresearch/dadaptation
Shampoo optimizer
- Support no_preconditioning_for_layers_with_dim_gt (default 8192)

Improvement

refactor/improve matrix_power(), unroll the loop due to the performance, #101
speed-up/fix power_iter(), not to deep-copy mat_v. #101

Docs

D-Adaptation optimizers & Shampoo utils

Assets 2

Loading

All reactions

pytorch-optimizer v2.3.1

31 Jan 13:20

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.3.1

Change Log

Feature

more add-ons for Shampoo optimizer, #99
- implement moving_average_for_momentum
- implement decoupled_weight_decay
- implement decoupled_learning_rate
- supports more grafting (RMSProp, SQRT_N)
- supports more PreConditioner (ALL, INPUT)

Docs

apply pydocstyle linter, #91

Refactor

deberta_v3_large_lr_scheduler, #91

ETC

add more Ruff rules (ICN, TID, ERA, RUF, YTT, PL), #91

Assets 2

Loading

All reactions

pytorch-optimizer v2.3.0

30 Jan 07:42

kozistr

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

pytorch-optimizer v2.3.0

Change Log

Feature

re-implement Shampoo Optimizer (#97, related to #93)
- layer-wise grafting (none, adagrad, sgd)
- block partitioner
- preconditioner
remove casting to fp16 or bf16 inside of the step() not to lose consistency with the other optimizers. #96
change some ops to in-place operations to speed up. #96

Fix

fix exp_avg_var when amsgrad is True. #96

Refactor

change linter from Pylint to Ruff, #97

Assets 2

Loading

gseonglee reacted with rocket emoji

All reactions

🚀 1 reaction

1 person reacted

Previous 1 2 3 4 5 6 7 8 Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.