Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for FiveCrop/TenCrop #72

Open
borijang opened this issue Sep 5, 2022 · 4 comments
Open

Support for FiveCrop/TenCrop #72

borijang opened this issue Sep 5, 2022 · 4 comments
Labels
help wanted Extra attention is needed idea for contribution An idea for contribution request for comments (RFC)

Comments

@borijang
Copy link
Contributor

borijang commented Sep 5, 2022

Hey, thanks for this repo.

I noticed a lot of architectures use five-cropped or ten-cropped features, so I am wondering if you considered/would consider implementing this feature.
Additionally, I am currently converting the frame rate of the videos before extracting the features, but I think it would be great if we could do both at once.

@v-iashin v-iashin added the idea for contribution An idea for contribution label Sep 5, 2022
@v-iashin
Copy link
Owner

v-iashin commented Sep 5, 2022

Yep, this would be nice to have. Do you mean the 5-crop and 10-crop features only for frame-wise feature extractors? Is there an equivalent for video?

Regarding the second part, among other reasons, I decided to re-encode it first to a file and then load it because loading the whole video with the original fps might be hurtful for the user's RAM. It is also the way how one would do it when pre-processing the dataset. Therefore, it is more convenient to do it separately for now.

A large number of corner cases when doing the conversion on the fly makes it unattractive for implementation within this library and I would rather rely on something popular and well-tested (e.g. how to make 17 fps video from 30 fps original).

We also currently discussing this and video IO in general in #61 You are welcome to participate in the discussion.

@borijang
Copy link
Contributor Author

borijang commented Sep 5, 2022

Thanks for the incredibly fast response!

Yep, specifically for me I'd use FiveCrop / TenCrop on the I3D model.

Regarding the fps conversion, I somehow missed the extraction_fps argument and utils/reencode_video_with_diff_fps, I think that's exactly what I wanted.

@v-iashin
Copy link
Owner

v-iashin commented Sep 5, 2022

Great! I will keep this opened.

If someone else would like to see FiveCrop / TenCrop implemented, let me know in this issue and I will take a look!

@v-iashin v-iashin changed the title Support for FiveCrop/TenCrop and frame rate conversion Support for FiveCrop/TenCrop Sep 5, 2022
@v-iashin v-iashin added the help wanted Extra attention is needed label May 2, 2024
@v-iashin
Copy link
Owner

v-iashin commented May 2, 2024

i don't have the resources to implement it myself but i'd love to consider a pull request implementing this (fixing or redoing the earlier attempt: #110).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed idea for contribution An idea for contribution request for comments (RFC)
Projects
None yet
Development

No branches or pull requests

2 participants