-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inconsistencies in schedules API #835
Comments
I'm ready to pick this up! @fabianp |
excellent @Abhinavcode13 ! which task would you like to pick up? I would suggest to pick one of the remaining 3 and start with that. Smaller PR are better for everyone :-) |
Sure |
FYI: I would look up the second one first. |
I can take this up, If @Abhinavcode13 is busy. I have some questions:
The
While
|
transition_steps
parameter, but in some cases (e.g.,optax.cosine_decay_schedule
,optax.warmup_cosine_decay_schedule
but confusingly notoptax.cosine_onecycle_schedule
) it's calleddecay_steps
instead.sgdr_schedule
is not descriptive of what the schedule actually does.linear_onecycle_schedule
andcosine_onecycle_schedule
specify the length of the warm-up phrase using parameterpct_start
, butwarmup_cosine_decay_schedule
instead specifies it through a parameterwarmup_steps
In the documentation:
5. In the API reference https://optax.readthedocs.io/en/latest/api/optimizer_schedules.html there's a section "Schedules with warm-up". I would consider
optax.cosine_onecycle_schedule
to have warm-up, yet it's not in this section. My recommendation would be to remove the section ""Schedules with warm-up" and putoptax.warmup_cosine_decay_schedule
in the Cosine decay schedule section and optax.warmup_exponential_decay_schedule in the exponential decay sectionThe text was updated successfully, but these errors were encountered: