Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question about encode #346

Closed
MichalTurek opened this issue Dec 19, 2023 · 3 comments
Closed

question about encode #346

MichalTurek opened this issue Dec 19, 2023 · 3 comments

Comments

@MichalTurek
Copy link

MichalTurek commented Dec 19, 2023

Hi, could you please describe what exactly is done and what's the output of encode in IFNet.forward()

 f0 = self.encode(img0[:, :3])
  f1 = self.encode(img1[:, :3])

It outputs tensor of size (1,8,width,height) but what exactly this tensor represents?

@hzwer
Copy link
Owner

hzwer commented Dec 20, 2023

My detailed ideas can be seen in https://arxiv.org/abs/2310.17294.
Warping features is better than only warping images, refer to Context-aware synthesis for video frame interpolation
It is an encoder learned by the model independently. I don't know how to explain what exactly it does.

@MichalTurek
Copy link
Author

okay, thank you. Another thing, is there any paper which further explain how mask is estimated? I am talking about mask which is used for forward and backward warped images merge into final output?

@hzwer
Copy link
Owner

hzwer commented Dec 20, 2023

You may refer to Superslomo

@hzwer hzwer closed this as completed May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants