Fix decoding issue with PYAV due to new support for multiple training… #541

dfan · 2022-05-10T23:04:38Z

See issue #181. When using pyav as a backend, we get the exception pyav with exception: unsupported operand type(s) for -: 'list' and 'int'. This is because num_frames is now a list. torchvision_decode() is correct but not pyav_decode().

… views

facebook-github-bot · 2022-05-10T23:04:42Z

Hi @dfan!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

facebook-github-bot · 2022-05-22T20:16:32Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

facebook-github-bot · 2022-05-22T21:00:39Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

facebook-github-bot · 2022-06-10T16:41:59Z

@lyttonhao has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

XinyuSun · 2022-06-10T23:46:50Z

Hi! There is an error in the code:
Line 442 and 443 should be:
442: video_start_pts = int(start_idx * timebase)
443: video_end_pts = int(end_idx * timebase)

facebook-github-bot · 2022-06-11T20:21:45Z

@lyttonhao has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

dfan · 2022-06-12T03:22:52Z

Very sorry ... yes you are right. With this additional fix I am now able to achieve correct results with the new code. Thanks and sorry again!

lyttonhao · 2022-06-12T04:02:49Z

Very sorry ... yes you are right. With this additional fix I am now able to achieve correct results with the new code. Thanks and sorry again!

Do you want to update the code? Thanks!

junchen14 · 2022-06-12T07:48:05Z

See issue #181. When using pyav as a backend, we get the exception pyav with exception: unsupported operand type(s) for -: 'list' and 'int'. This is because num_frames is now a list. torchvision_decode() is correct but not pyav_decode().

this can solve the problems that I faced

StawEndl · 2022-06-13T10:21:10Z

Very sorry ... yes you are right. With this additional fix I am now able to achieve correct results with the new code. Thanks and sorry again!

hi,
i use the pyav_decode functiopn to read the video, but this function return the decode_all_video which is False. When decode_all_video is False, the start_end_delta_time is not created.
#557
how to fix it?

XinyuSun · 2022-06-13T11:59:50Z

pyav is much slower than other decode backends. I'm considering adding support to the decord backend.

facebook-github-bot · 2022-06-14T17:39:23Z

@lyttonhao has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-06-16T03:09:02Z

@dfan has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2022-06-16T06:12:56Z

@lyttonhao has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

lyttonhao

Looks great! We should also handle the case when duration is None

lyttonhao · 2022-06-16T06:22:36Z

slowfast/datasets/decoder.py

@@ -414,35 +418,49 @@ def pyav_decode(
    else:


For line 414-417, we should also decode the whole videos and return the frames. Example code could be:

decode_all_video = True video_start_pts, video_end_pts = 0, math.inf start_end_delta_time = None frames = None if container.streams.video: video_frames, max_pts = pyav_decode_stream( container, video_start_pts, video_end_pts, container.streams.video[0], {"video": 0}, ) frames = [frame.to_rgb().to_ndarray() for frame in video_frames] frames = torch.as_tensor(np.stack(frames)) frames_out = [frames]

facebook-github-bot · 2022-06-23T00:56:01Z

@dfan has updated the pull request. You must reimport the pull request before landing.

dfan · 2022-06-23T00:56:59Z

Good catch! I saw there were additional fixes to the decoder.py in 6069ea9 - do you want me to include those changes or will you merge those in?

lyttonhao · 2022-06-29T03:38:33Z

Good catch! I saw there were additional fixes to the decoder.py in 6069ea9 - do you want me to include those changes or will you merge those in?

Yeah, that would be great if you could also include those changes!

facebook-github-bot · 2022-07-23T07:47:30Z

@dfan has updated the pull request. You must reimport the pull request before landing.

dfan · 2022-07-23T07:49:42Z

Apologies for the delay

cir7 · 2022-10-12T08:18:04Z

@dfan hi, i am curious why remove parameter use_offset when call get_multiple_start_end_idx in line 456

M-Masuhara · 2022-10-19T03:38:48Z

Hi. I started using SlowFast last week and I get the same error as you.

As for DECODING_BACKEND, I have tried pyav and torchvision, but both show the following error.

pyav:
Failed to decode by pyav with exception: unsupported operand type(s) for -: 'list' and 'int'
Failed to decode video idx 63 from C:/ ~ video path ; trial 99

torchvision:
Failed to decode by torchvision with exception: 'OpNamespace' object has no attribute 'probe_video_from_memory'
Failed to decode video idx 63 from C:/ ~ video path ; trial 99

As for slowfast>datasets>decoder.py, it has already been corrected as described here.

The libraries version are as follows:

PyTorch = 1.12.1
torchvision = 0.13.1
ffmpeg = 1.4
pyav = 9.2.0

What is not good?
I would like to solve this problem somehow with pyav.
Please help.

zsz00 · 2022-12-21T04:32:15Z

I meet leaks issue too.

Fix decoding issue with PYAV due to new support for multiple training…

6808e41

… views

dfan mentioned this pull request May 10, 2022

Torchvision backend not working. #181

Open

sunrise1000 approved these changes May 16, 2022

View reviewed changes

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 22, 2022

dfan mentioned this pull request May 27, 2022

New Commits in Repo Break Previous Code #544

Closed

Fix start end pts in decoder

379533a

lyttonhao suggested changes Jun 16, 2022

View reviewed changes

Add condition for decode whole video in pyav_decode

bc673f9

lyttonhao mentioned this pull request Jul 6, 2022

PyAV decoding backend no longer works #563

Closed

alpargun mentioned this pull request Jul 14, 2022

RuntimeError: Failed to fetch video idx 168596 from /data/k400/train/salsa_dancing/EY6MSW3zkr8_000048_000058.avi; after 99 trials #558

Open

Merge recent changes to decoder.py

e0d247a

JerryYLi pushed a commit to JerryYLi/SlowFast that referenced this pull request Jul 23, 2022

Merge pull request facebookresearch#541

b6a45f1

wnzhyee mentioned this pull request Nov 24, 2022

pyav decode leads memory leaks issue when 'duration' is none #626

Open

bihanikeshav approved these changes Jan 8, 2023

View reviewed changes

haritha91 added a commit to haritha91/SlowFast that referenced this pull request Jan 13, 2023

add decoder from facebookresearch#541

4a3edb4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix decoding issue with PYAV due to new support for multiple training… #541

Fix decoding issue with PYAV due to new support for multiple training… #541

dfan commented May 10, 2022

facebook-github-bot commented May 10, 2022

facebook-github-bot commented May 22, 2022

facebook-github-bot commented May 22, 2022

facebook-github-bot commented Jun 10, 2022

XinyuSun commented Jun 10, 2022

facebook-github-bot commented Jun 11, 2022

dfan commented Jun 12, 2022 •

edited

lyttonhao commented Jun 12, 2022

junchen14 commented Jun 12, 2022

StawEndl commented Jun 13, 2022

XinyuSun commented Jun 13, 2022

facebook-github-bot commented Jun 14, 2022

facebook-github-bot commented Jun 16, 2022

facebook-github-bot commented Jun 16, 2022

lyttonhao left a comment

lyttonhao Jun 16, 2022

facebook-github-bot commented Jun 23, 2022

dfan commented Jun 23, 2022

lyttonhao commented Jun 29, 2022

facebook-github-bot commented Jul 23, 2022

dfan commented Jul 23, 2022

cir7 commented Oct 12, 2022

M-Masuhara commented Oct 19, 2022

zsz00 commented Dec 21, 2022

Fix decoding issue with PYAV due to new support for multiple training… #541

Are you sure you want to change the base?

Fix decoding issue with PYAV due to new support for multiple training… #541

Conversation

dfan commented May 10, 2022

facebook-github-bot commented May 10, 2022

Action Required

Process

facebook-github-bot commented May 22, 2022

facebook-github-bot commented May 22, 2022

facebook-github-bot commented Jun 10, 2022

XinyuSun commented Jun 10, 2022

facebook-github-bot commented Jun 11, 2022

dfan commented Jun 12, 2022 • edited

lyttonhao commented Jun 12, 2022

junchen14 commented Jun 12, 2022

StawEndl commented Jun 13, 2022

XinyuSun commented Jun 13, 2022

facebook-github-bot commented Jun 14, 2022

facebook-github-bot commented Jun 16, 2022

facebook-github-bot commented Jun 16, 2022

lyttonhao left a comment

Choose a reason for hiding this comment

lyttonhao Jun 16, 2022

Choose a reason for hiding this comment

facebook-github-bot commented Jun 23, 2022

dfan commented Jun 23, 2022

lyttonhao commented Jun 29, 2022

facebook-github-bot commented Jul 23, 2022

dfan commented Jul 23, 2022

cir7 commented Oct 12, 2022

M-Masuhara commented Oct 19, 2022

zsz00 commented Dec 21, 2022

dfan commented Jun 12, 2022 •

edited