Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问如何理解 codes dimension #29

Open
jasonppy opened this issue Jun 26, 2023 · 0 comments
Open

请问如何理解 codes dimension #29

jasonppy opened this issue Jun 26, 2023 · 0 comments

Comments

@jasonppy
Copy link

感谢开源精彩的工作!

我想确认一下我对输出的 codes 的 ordering 的理解:
VQVAE encode 函数的输出形状是 [B, T, 4]。
假设 B=1, T=2,codes 是
[[a,b,c,d]
[e,f,g,h]]

判断:
a 是 T=1 的feature 的前一半 第一次quantize 得到的code,
b 是 T=1 的feature 的后一半 第一次 quantize 得到的code,
c 是 quantize a 的 residual 得到的 code
...

h 是 quantize f 的 residual 得到的 code
请问这样的判断对吗?

谢谢
Puyuan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant