[Feature] nn.LazyLinear #1484

holdjun · 2024-02-02T03:27:55Z

What is the feature?

The use of nn.LazyLinear will result in an error in the _dump_init_info function of the BaseModule. The main reason is that _dump_init_info writes the model's weights to the saved info, and if nn.LazyLinear is used, the weights are not initialized before the forward pass, leading to an error.
It is hoped that support for the use of nn.LazyLinear can be provided.

Any other context?

https://pytorch.org/docs/stable/generated/torch.nn.modules.lazy.LazyModuleMixin.html#torch.nn.modules.lazy.LazyModuleMixin
https://pytorch.org/docs/stable/generated/torch.nn.LazyLinear.html#torch.nn.LazyLinear

The text was updated successfully, but these errors were encountered:

zhouzaida · 2024-02-04T15:54:51Z

Hi @holdjun , thanks for your feedback. We will fix it ASAP.

zhouzaida · 2024-02-07T02:51:33Z

Hi @holdjun , if you use lazy modules, what is your expected behavior (skip or other action) when loading checkpoints?

zhouzaida added the bug Something isn't working label Feb 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] nn.LazyLinear #1484

[Feature] nn.LazyLinear #1484

holdjun commented Feb 2, 2024

zhouzaida commented Feb 4, 2024

zhouzaida commented Feb 7, 2024 •

edited

[Feature] nn.LazyLinear #1484

[Feature] nn.LazyLinear #1484

Comments

holdjun commented Feb 2, 2024

What is the feature?

Any other context?

zhouzaida commented Feb 4, 2024

zhouzaida commented Feb 7, 2024 • edited

zhouzaida commented Feb 7, 2024 •

edited