Documentation for cosine decay schedule #905

gjhuizing · 2024-04-04T13:55:53Z

Hello,

The formula in the documentation for the cosine_decay_schedule (https://optax.readthedocs.io/en/latest/api/optimizer_schedules.html#optax.cosine_decay_schedule) would suggest that the learning rate increases again after T steps.

A quick look at the code confirms this is not the case, but it may be good to write it explicitly, as in linear_schedule.

Happy to make a short PR! I also could propose a short formula/pseudocode for functions like piecewise_constant_schedule that do not have one.

Best

GJ

The text was updated successfully, but these errors were encountered:

vroulet · 2024-04-04T17:43:59Z

Thanks for catching this! If you are willing to do such a PR that would be great!

gjhuizing · 2024-04-04T21:45:08Z

Great!

vroulet added documentation Improvements or additions to documentation good first issue Good for newcomers labels May 27, 2024

Provide feedback