Training VQVAE dose not convergence. #71

henanjun · 2022-11-29T03:29:48Z

This is the reconstruction result of 100 Epoch of VQVAE.

blade-prayer · 2022-12-02T08:49:47Z

I found that in vq_vae.yaml, the scheduler_gamma is set to be 0.0. This parameter controls the multiplicative factor in torch.optim.lr_scheduler.ExponentialLR, which makes the learning rate to be 0 after epoch 0. Do you think this is the reason?

imskull · 2022-12-09T11:35:09Z

Changing "LR"(learning rate" from 0.005 to 0.001 helps.

xjtupanda · 2023-03-18T02:00:01Z

Changing "LR"(learning rate" from 0.005 to 0.001 helps.

Met the same problem. I found the loss always unreasonably high (~1.0e+6) and it might cause a gradient explosion. This one helps, thanks a lot.

ohhh-yang · 2024-03-27T09:05:48Z

In my training process the loss is more unreasonable(up to 1.0e+26!). And it just fluctuates like a pendulum. This one really works, you are my god!!!

Changing "LR"(learning rate" from 0.005 to 0.001 helps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training VQVAE dose not convergence. #71

Training VQVAE dose not convergence. #71

henanjun commented Nov 29, 2022

blade-prayer commented Dec 2, 2022

imskull commented Dec 9, 2022 •

edited

xjtupanda commented Mar 18, 2023

ohhh-yang commented Mar 27, 2024

Training VQVAE dose not convergence. #71

Training VQVAE dose not convergence. #71

Comments

henanjun commented Nov 29, 2022

blade-prayer commented Dec 2, 2022

imskull commented Dec 9, 2022 • edited

xjtupanda commented Mar 18, 2023

ohhh-yang commented Mar 27, 2024

imskull commented Dec 9, 2022 •

edited