The results for CausalConv3d #11

Epiphqny · 2023-11-16T02:02:23Z

Hi @lucidrains , thanks for your awesome work! I used your causal conv implementation and trained on a video vqgan network. The results are as follows:
Original clip sequence:

The reconstructed clip sequence:

I've noticed that the reconstruction seems to heavily rely on the initial frame. As the sequence progresses, the clarity of the images appears to diminish, leading to a more blurring effect with each subsequent frame. Could you provide any insights into this phenomenon? Thank you for your time and assistance!

lucidrains · 2023-11-16T15:33:09Z

@Epiphqny wow Yuqing! those results do not look half bad! i'll have to think about your results a bit more. so this work builds upon the cvivit from the phenaki paper. in that paper, i believe they encode the first frame separately from the rest (to allow for single image pretraining). however, in this work, they decide to just pad on the left and use the same encoding for the first frame vs the rest. perhaps i can add the cvivit way for the sake of comparing the two

lucidrains · 2023-11-16T15:49:09Z

@Epiphqny once i circle back to this, also want to craft out a few more specialized discriminators (fourier domain as well as temporal)

lucidrains · 2023-11-16T16:00:04Z

@Epiphqny did you use LFQ or FSQ btw? could you share your hyperparameters?

lucidrains · 2023-11-16T17:31:45Z

@Epiphqny added it here if you want to run some experiments

Epiphqny · 2023-11-17T01:36:29Z

Hi @lucidrains, thanks for your prompt response! Actually, I didn't use the LFQ or FSQ, instead, I used the quantization in CVQ-VAE https://github.com/lyndonzheng/CVQ-VAE, and extend the 2D conv to 3D causal conv like magvit2. For the training parameters, I've followed the setup used in VQGAN and initialized the weights using a CVQ-VAE model prertrained on image data. I will trained the updated code of first frame and looking forward to the updated discriminator!

lucidrains · 2023-11-17T02:32:16Z

@Epiphqny ohh i see! i didn't know you only used the causal conv

i'm not sure what the issue is then

Epiphqny · 2023-11-18T03:12:19Z

@lucidrains Thanks for your response ! I will try more modules in this implementation and update the results later.

sijeh · 2024-03-11T06:28:49Z

@lucidrains Thanks for your response ! I will try more modules in this implementation and update the results later.

Hi @Epiphqny , Is there any progress on improving results?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The results for CausalConv3d #11

The results for CausalConv3d #11

Epiphqny commented Nov 16, 2023

lucidrains commented Nov 16, 2023

lucidrains commented Nov 16, 2023

lucidrains commented Nov 16, 2023 •

edited

lucidrains commented Nov 16, 2023

Epiphqny commented Nov 17, 2023

lucidrains commented Nov 17, 2023

Epiphqny commented Nov 18, 2023

sijeh commented Mar 11, 2024

The results for CausalConv3d #11

The results for CausalConv3d #11

Comments

Epiphqny commented Nov 16, 2023

lucidrains commented Nov 16, 2023

lucidrains commented Nov 16, 2023

lucidrains commented Nov 16, 2023 • edited

lucidrains commented Nov 16, 2023

Epiphqny commented Nov 17, 2023

lucidrains commented Nov 17, 2023

Epiphqny commented Nov 18, 2023

sijeh commented Mar 11, 2024

lucidrains commented Nov 16, 2023 •

edited