Image Resolution 112*112 #272

darshvirbelandis · 2022-09-08T05:54:39Z

I wanted to be able to input larger image resolutions. However when I do input image size of 480*480 it takes almost 10 minutes to process a tiny 10 second clip.

It seems when I increase image size, the model inference run-time become exponentially greater.

There is crucial motion information being lost when I downscale my images to 112*112 and it is effecting the precision of the model on my test sets.

Is there any alternative model or method that will allow me to proceed with larger image resolutions using the 3D-ResNet model?

Is it practical to use 3D-CNN with input sizes of 480*480 images for video classification tasks?

87003697 · 2022-09-08T05:55:02Z

这是来自QQ邮箱的假期自动回复邮件。您好，我最近正在休假中，无法亲自回复您的邮件。我将在假期结束后，尽快给您回复。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image Resolution 112*112 #272

Image Resolution 112*112 #272

darshvirbelandis commented Sep 8, 2022 •

edited

87003697 commented Sep 8, 2022 via email

Image Resolution 112*112 #272

Image Resolution 112*112 #272

Comments

darshvirbelandis commented Sep 8, 2022 • edited

87003697 commented Sep 8, 2022 via email

darshvirbelandis commented Sep 8, 2022 •

edited