[FEAT] Add option to support user defined learning rate scheduler for NeuralForecast Models #998

JQGoh · 2024-05-09T18:03:19Z

Rationale

Similar to the work of adding the support of an user-defined optimizer, now we can accept an user defined scheduler. This feature is asked in How to Use Different Optimizers with NeuralForecast Models #852 (comment) and https://nixtlacommunity.slack.com/archives/C031M8RLC66/p1713516602654109

Omit unused import in itransformer.py

review-notebook-app · 2024-05-09T18:03:25Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Fix assertion check Fix iterable issue fix parameter passed to user-defined lr_scheduler

JQGoh · 2024-05-09T19:07:22Z

@jmoralez Please review. Hope this might be useful to the community too.

elephaint · 2024-05-13T08:40:41Z

@JQGoh Thanks for your work, the PR looks good to me. @cchallu @jmoralez I'm happy to add this as feature to neuralforecast, wdyt?

JQGoh · 2024-05-16T14:13:47Z

@jmoralez @cchallu If you have some time, appreciate your review on this. As I notice recently there has been new models added to neuralforecast, would love to get this merged (subject to approval/reviewed) and later the other newly implemented models shall consider these arguments

jmoralez · 2024-05-16T15:39:42Z

Can you please add warnings if the user provides lr_scheduler_kwargs but doesn't provide lr_scheduler (that the kwargs will be ignored)? We got a similar request for the optimizer by the sktime folks.

JQGoh · 2024-05-16T15:44:00Z

Can you please add warnings if the user provides lr_scheduler_kwargs but doesn't provide lr_scheduler (that the kwargs will be ignored)? We got a similar request for the optimizer by the sktime folks.

@jmoralez
Sounds good to me. I shall add warnings for

user provides lr_scheduler_kwargs but doesn't provide lr_scheduler
user provides optimizer_kwargs but doesn't provide optimizer.

If you have the link/reference to the ask by sktime folks, can mention in this PR too. Thanks

jmoralez · 2024-05-16T15:46:13Z

sktime/sktime#6235 (comment)

lr_scheduler kwargs should be checked with 'if obj'

…izer

neuralforecast/common/_base_model.py

BrunoBelucci · 2024-05-22T11:24:33Z

I was facing the same issue when I wanted to use a custom scheduler and you have a solution that is pretty much what I have, the only thing that I am missing in it is that I think we should also be able to change the other arguments of "lr_scheduler_config" like the frequency or the interval (https://lightning.ai/docs/pytorch/stable/common/lightning_module.html#configure-optimizers for a list of other parameters). My solution was in fact to add 2 new kwargs, lr_scheduler_kwargs as you have and lr_scheduler_config as expected by lightning, only adding the actual scheduler to it in configure_optimizers with lr_scheduler_config['scheduler'] = self.lr_scheduler(optimizer=optimizer, **lr_scheduler_kwargs). I think that if someone is bothering to manually change the lr_scheduler it makes sense to give them full control over it.

JQGoh · 2024-05-22T14:58:44Z

I was facing the same issue when I wanted to use a custom scheduler and you have a solution that is pretty much what I have, the only thing that I am missing in it is that I think we should also be able to change the other arguments of "lr_scheduler_config" like the frequency or the interval (https://lightning.ai/docs/pytorch/stable/common/lightning_module.html#configure-optimizers for a list of other parameters). My solution was in fact to add 2 new kwargs, lr_scheduler_kwargs as you have and lr_scheduler_config as expected by lightning, only adding the actual scheduler to it in configure_optimizers with lr_scheduler_config['scheduler'] = self.lr_scheduler(optimizer=optimizer, **lr_scheduler_kwargs). I think that if someone is bothering to manually change the lr_scheduler it makes sense to give them full control over it.

@BrunoBelucci
Thanks for your suggestion. I lean toward allowing the users to have more freedom in customizing the behaviour, including frequency.

On top of the prepared PR, think I may need an additional arg called lr_scheduler_config that represents the set of arguments as detailed in https://lightning.ai/docs/pytorch/stable/common/lightning_module.html#configure-optimizers, but it shall ignore the "scheduler" (since we will specify it separately, plus we can ensure that lr_scheduler using the same optimizer).
If Nixtla team agrees that lr_scheduler_config is a nice to have feature, I can prepare this in the next PR.
cc: @jmoralez

jmoralez · 2024-05-22T15:26:17Z

I'm not really a fan of adding so many arguments to all models. Would it be possible to instead provide a function that overrides the configure_optimizers method? Either by calling it or monkey patching.

BrunoBelucci · 2024-05-23T10:32:32Z

Honestly, I think that in this case, the cleanest solution is to add one more argument. We already have arguments for the trainer, optimizer, and scheduler. Introducing a different approach for configuring the scheduler, which is part of the same system, could confuse users and require them to write additional boilerplate code to achieve their goals. Here are some scenarios to further develop my point:

Using the CosineAnnealingLR Scheduler
The default scheduler configuration is fine because the scheduler is not affected by it.
Using the OneCycleLR Scheduler
The default scheduler configuration is fine if the user knows that the default is {'frequency': 1, 'interval': 'step'}. They would only need to calculate the number of steps to pass as total_steps in the lr_scheduler_kwargs. However, if they do not know this, they would need to check the code (since it is not documented) to ensure the model trains as intended.
Using the ReduceLROnPlateau Scheduler
In this case, the user needs to change the scheduler configuration because, typically, we want to identify a plateau by epoch rather than by step. Additionally, the user has to specify the "monitor" metric they want. Without the ability to pass the scheduler configuration directly, users would need to use one approach to pass the scheduler and its kwargs and another approach (such as calling another function or monkey patching) to overwrite the scheduler configuration.

I see no reason for users to have to handle each scenario above differently. Additionally, this approach would allow us to document the default behavior clearly.

jmoralez · 2024-05-23T16:58:15Z

What I mean is that we currently have two arguments (for the optimizer), this adds two more (for the scheduler) and you're proposing adding a fifth, when all of these are going to be used in the same method (configure_optimizers) and they all could've been just one where the user passes a callable that takes the model parameters and returns what pytorch lightning expects from the configure_optimizers method.

Given that sktime is already using the optimizer arguments it'd be a bad experience to deprecate them and introduce a new one that takes a function, so I think we should move forward with adding more to not deprecate something we recently introduced, I just wish we'd done the single argument approach from the start.

jmoralez

Thanks for the thorough tests, as always!

JQGoh · 2024-05-23T17:10:28Z

What I mean is that we currently have two arguments (for the optimizer), this adds two more (for the scheduler) and you're proposing adding a fifth, when all of these are going to be used in the same method (configure_optimizers) and they all could've been just one where the user passes a callable that takes the model parameters and returns what pytorch lightning expects from the configure_optimizers method.

Given that sktime is already using the optimizer arguments it'd be a bad experience to deprecate them and introduce a new one that takes a function, so I think we should move forward with adding more to not deprecate something we recently introduced, I just wish we'd done the single argument approach from the start.

@jmoralez
I also wished that I could have implemented it differently, did not consider about the option of modifying the default configure_optimizes behavior.

By the way, shall we re-consider this implementation and revert this PR?

I implemented the option of modifying configure_optimizers behavior via set_configure_optimizers at BaseModel level, please check the work in
#1015

JQGoh added 3 commits May 10, 2024 00:36

Add support of user defined lr_scheduler and related arguments

358eef0

models accept lr_scheduler related args.

4b41cc1

Omit unused import in itransformer.py

Add tests

0c0345b

JQGoh force-pushed the feat/user-defined-scheduler branch 3 times, most recently from e3de57f to 2a7d761 Compare May 9, 2024 18:35

JQGoh added 2 commits May 10, 2024 02:42

Fix assertion check and tests

5a013dd

Fix assertion check Fix iterable issue fix parameter passed to user-defined lr_scheduler

Fix missing required parameters

17a895b

JQGoh force-pushed the feat/user-defined-scheduler branch from 2a7d761 to 17a895b Compare May 9, 2024 18:42

JQGoh added 2 commits May 17, 2024 00:23

Review: Add warning if users pass only lr_scheduler_kwargs but not

8445a4a

lr_scheduler kwargs should be checked with 'if obj'

Review: Add warning if users pass only optimizer_kwargs but not optim…

849261a

…izer

JQGoh commented May 16, 2024

View reviewed changes

neuralforecast/common/_base_model.py Show resolved Hide resolved

JQGoh mentioned this pull request May 23, 2024

[FEAT] Add option to modify the default configure_optimizers() behavior #1015

Draft

jmoralez approved these changes May 23, 2024

View reviewed changes

jmoralez merged commit 5fc342c into Nixtla:main May 23, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT] Add option to support user defined learning rate scheduler for NeuralForecast Models #998

[FEAT] Add option to support user defined learning rate scheduler for NeuralForecast Models #998

JQGoh commented May 9, 2024 •

edited

review-notebook-app bot commented May 9, 2024

JQGoh commented May 9, 2024

elephaint commented May 13, 2024

JQGoh commented May 16, 2024

jmoralez commented May 16, 2024

JQGoh commented May 16, 2024 •

edited

jmoralez commented May 16, 2024

BrunoBelucci commented May 22, 2024 •

edited

JQGoh commented May 22, 2024

jmoralez commented May 22, 2024

BrunoBelucci commented May 23, 2024

jmoralez commented May 23, 2024

jmoralez left a comment

JQGoh commented May 23, 2024 •

edited

[FEAT] Add option to support user defined learning rate scheduler for NeuralForecast Models #998

[FEAT] Add option to support user defined learning rate scheduler for NeuralForecast Models #998

Conversation

JQGoh commented May 9, 2024 • edited

Rationale

review-notebook-app bot commented May 9, 2024

JQGoh commented May 9, 2024

elephaint commented May 13, 2024

JQGoh commented May 16, 2024

jmoralez commented May 16, 2024

JQGoh commented May 16, 2024 • edited

jmoralez commented May 16, 2024

BrunoBelucci commented May 22, 2024 • edited

JQGoh commented May 22, 2024

jmoralez commented May 22, 2024

BrunoBelucci commented May 23, 2024

jmoralez commented May 23, 2024

jmoralez left a comment

Choose a reason for hiding this comment

JQGoh commented May 23, 2024 • edited

JQGoh commented May 9, 2024 •

edited

JQGoh commented May 16, 2024 •

edited

BrunoBelucci commented May 22, 2024 •

edited

JQGoh commented May 23, 2024 •

edited