Is there anyone success to train this model? #34

Jihun999 · 2024-02-16T16:59:24Z

I tried to train this model few days. However, the reconstruction results always abnormal. If there is anyone success to train this model, can you tell me some tips for training?

Jihun999 · 2024-02-16T17:10:59Z

The reconstruction images are like solid image.

RobertLuo1 · 2024-02-20T02:35:59Z

Can you show the reconstruction images after training?

Jihun999 · 2024-02-20T05:57:44Z

It always looks like this image.

RobertLuo1 · 2024-02-20T11:38:31Z

@bridenmj How much epochs do you use? Are you working on the ImageNet Pretrain?

Jihun999 · 2024-02-20T13:05:36Z

Yes I'm working on ImageNet pretraining, It passed 12000 steps. The output image looks always the same. So, I tried LFQ in my own autoencoder, the training works well. It looks like there is something wrong in magvit2 model architecture.

RobertLuo1 · 2024-02-20T13:22:38Z

Actually I reimplement the model structure to align with the magvit2 paper. But I find that the LFQ Loss is negative and the recon loss will get converage easily with or without GAN. The reconstructed images are vague but not the solid color. What about you? @Jihun999

Jihun999 · 2024-02-25T13:07:00Z

Ok, I will reimplement the model first. Thank you for your comment.

Jason3900 · 2024-03-06T08:31:07Z

Actually I reimplement the model structure to align with the magvit2 paper. But I find that the LFQ Loss is negative and the recon loss will get converage easily with or without GAN. The reconstructed images are vague but not the solid color. What about you? @Jihun999

Hey, is it possible to share the code modification for model architecture alignment? Thanks a lot!

lucidrains · 2024-04-25T13:59:37Z

someone i know has trained it successfully.

Jiushanhuadao · 2024-04-27T14:10:31Z

wow, could i know who did it.

StarCycle · 2024-05-15T04:12:25Z

@RobertLuo1 @Jihun999 @lucidrains If you successfully trained this model, would you like to share the pretrained weights and the modified model code?

vinyesm · 2024-05-16T19:44:57Z

Hello there,
Thanks @lucidrains for your work! I have successful trainings on toy data (tried it on images and video) with code in this fork https://github.com/vinyesm/magvit2-pytorch/blob/trainvideo/examples/train-on-video.py and with this video data https://huggingface.co/datasets/mavi88/phys101_frames/tree/main. What seemed to fix the issue is to stop using accelerate (I only train on one GPU).

I tried with only MSE and then also the other losses, and also with/without attend_space layers. All work but I did not try to tune hyperparameters..

lucidrains · 2024-05-16T22:59:08Z

thank you for sharing this Marina! I'll see if I can find the bug, and worse comes to worse, can always rewrite the training code in pytorch lightning

Jihun999 closed this as completed Feb 16, 2024

Jihun999 closed this as not planned Won't fix, can't repro, duplicate, stale Feb 16, 2024

Jihun999 closed this as completed Feb 16, 2024

Jihun999 reopened this Feb 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there anyone success to train this model? #34

Is there anyone success to train this model? #34

Jihun999 commented Feb 16, 2024

Jihun999 commented Feb 16, 2024

RobertLuo1 commented Feb 20, 2024 •

edited

Jihun999 commented Feb 20, 2024

RobertLuo1 commented Feb 20, 2024

Jihun999 commented Feb 20, 2024

RobertLuo1 commented Feb 20, 2024

Jihun999 commented Feb 25, 2024

Jason3900 commented Mar 6, 2024

lucidrains commented Apr 25, 2024

Jiushanhuadao commented Apr 27, 2024

StarCycle commented May 15, 2024

vinyesm commented May 16, 2024

lucidrains commented May 16, 2024

Is there anyone success to train this model? #34

Is there anyone success to train this model? #34

Comments

Jihun999 commented Feb 16, 2024

Jihun999 commented Feb 16, 2024

RobertLuo1 commented Feb 20, 2024 • edited

Jihun999 commented Feb 20, 2024

RobertLuo1 commented Feb 20, 2024

Jihun999 commented Feb 20, 2024

RobertLuo1 commented Feb 20, 2024

Jihun999 commented Feb 25, 2024

Jason3900 commented Mar 6, 2024

lucidrains commented Apr 25, 2024

Jiushanhuadao commented Apr 27, 2024

StarCycle commented May 15, 2024

vinyesm commented May 16, 2024

lucidrains commented May 16, 2024

RobertLuo1 commented Feb 20, 2024 •

edited