Integrate Audiovisual SlowFast Networks into the repo #219

fanyix · 2020-06-14T06:47:32Z

Please see projects/avslowfast/README.md for a starter for training and evaluation with an AVSlowFast 4x16 R50 model.

facebook-github-bot · 2020-06-14T06:47:48Z

Thank you for your pull request. We require contributors to sign our Contributor License Agreement, and yours needs attention.

You currently have a record in our system, but we do not have a signature on file.

In order for us to review and merge your code, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

If you have received this in error or have any questions, please contact us at cla@fb.com. Thanks!

facebook-github-bot

@takatosp1 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-06-15T17:22:36Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Facebook open source project. Thanks!

facebook-github-bot · 2020-08-11T20:15:45Z

@fanyix has updated the pull request. You must reimport the pull request before landing.

facebook-github-bot · 2020-08-11T20:41:24Z

@fanyix has updated the pull request. You must reimport the pull request before landing.

marteautony · 2020-10-02T06:19:12Z

Hi,

There is a problem with the visualization of video. Audio are not provided so the next error is printed.

Traceback (most recent call last):
File "tools/run_net.py", line 42, in
main()
File "tools/run_net.py", line 37, in main
demo(cfg)
File "/workspace/Documents/Tony/AVSlowFast/tools/demo_net.py", line 119, in demo
for frames in tqdm.tqdm(run_demo(cfg, frame_provider)):
File "/usr/local/lib/python3.6/dist-packages/tqdm/std.py", line 1174, in iter
for obj in iterable:
File "/workspace/Documents/Tony/AVSlowFast/tools/demo_net.py", line 79, in run_demo
model.put(task)
File "/workspace/Documents/Tony/AVSlowFast/slowfast/visualization/predictor.py", line 138, in put
task = self.predictor(task)
File "/workspace/Documents/Tony/AVSlowFast/slowfast/visualization/predictor.py", line 77, in call
inputs = process_cv2_inputs(frames, self.cfg)
File "/workspace/Documents/Tony/AVSlowFast/slowfast/visualization/utils.py", line 321, in process_cv2_inputs
inputs = [inp.unsqueeze(0) for inp in inputs]
File "/workspace/Documents/Tony/AVSlowFast/slowfast/visualization/utils.py", line 321, in
inputs = [inp.unsqueeze(0) for inp in inputs]
AttributeError: 'NoneType' object has no attribute 'unsqueeze'

haooooooqi · 2020-10-13T00:28:54Z

Hey @fanyix

Thanks for committing the code into PySlowFast, could you remind me if you have trained the av slowfast model in PySlowFast codebase? Could you remind me what is the performance you got?

Thanks,
Haoqi

fanyix · 2020-10-13T01:19:07Z

Hey @fanyix

Thanks for committing the code into PySlowFast, could you remind me if you have trained the av slowfast model in PySlowFast codebase? Could you remind me what is the performance you got?

Thanks,
Haoqi

Hi Haoqi,

I haven't trained from scratch using pySlowFast, however I did try converting caffe2 models and finetune it with a short schedule in pySlowFast. I got something around 0.5 gap to the number achieved in the caffe2 codebase. It's possible my finetuning schedule is not optimized so it would be great if you can try it on your end (or even better train from scratch).

SuX97 · 2020-11-26T07:40:21Z

slowfast/models/resnet_helper.py

+        self.t_relu = nn.ReLU(inplace=self._inplace_relu)
+
+        # 1x1x3, BN, ReLU.
+        self.f = nn.Conv3d(


Hi, I am trying to re-implement AVSlowfast, and feel confused here.

In the paper (Table 1), the dim_inner should be the same as the planes, but the code here shows that the dim_inner is 2 * planes. i.e., for res2: [3×1, 1×3], 32 should be [3×1, 1×3], 64.

SuX97 · 2020-11-26T07:52:02Z

slowfast/models/video_model_builder.py

+                dim_inner // cfg.SLOWFAST.AU_BETA_INV
+            ],
+            temp_kernel_sizes=temp_kernel[1],
+            stride=[1] * 3,


Besides, here the stride means the stride of res1 of the audio pathway is 1. However in the paper Downsampling in time-frequency space is performed by stride 2^2 convolution in the center (“bottleneck”) filter of the first residual block in each stage from res2 to res5. In the code, it seems that res2 is not included.

facebook-github-bot

@theschnitz has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@theschnitz has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

fyxiao added 11 commits June 11, 2020 15:55

update

ce6934c

update

6e448ca

update

d35fb50

update AVSlowFast README

a5bb1a6

update

9059be7

update

d3a3d3f

update

2890bb0

Update Install guide for AVSlowFast

591c407

Adding comments

6cafa57

update

33593a4

update

6444255

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 14, 2020

facebook-github-bot reviewed Jun 15, 2020

View reviewed changes

fanyix marked this pull request as ready for review August 11, 2020 20:09

Merge branch 'master' into master

5ab2049

Update README.md

629fd1b

SuX97 reviewed Nov 26, 2020

View reviewed changes

facebook-github-bot reviewed Dec 14, 2020

View reviewed changes

chongruo mentioned this pull request Oct 12, 2021

Audiovisual SlowFast Networks for Video Recognition #486

Open

haooooooqi mentioned this pull request Aug 3, 2022

About AudioVisual SlowFast Model #567

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate Audiovisual SlowFast Networks into the repo #219

Integrate Audiovisual SlowFast Networks into the repo #219

fanyix commented Jun 14, 2020

facebook-github-bot commented Jun 14, 2020

facebook-github-bot left a comment

facebook-github-bot commented Jun 15, 2020

facebook-github-bot commented Aug 11, 2020

facebook-github-bot commented Aug 11, 2020

marteautony commented Oct 2, 2020

haooooooqi commented Oct 13, 2020

fanyix commented Oct 13, 2020

SuX97 Nov 26, 2020

SuX97 Nov 26, 2020

facebook-github-bot left a comment

facebook-github-bot left a comment

Integrate Audiovisual SlowFast Networks into the repo #219

Are you sure you want to change the base?

Integrate Audiovisual SlowFast Networks into the repo #219

Conversation

fanyix commented Jun 14, 2020

facebook-github-bot commented Jun 14, 2020

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Jun 15, 2020

facebook-github-bot commented Aug 11, 2020

facebook-github-bot commented Aug 11, 2020

marteautony commented Oct 2, 2020

haooooooqi commented Oct 13, 2020

fanyix commented Oct 13, 2020

SuX97 Nov 26, 2020

Choose a reason for hiding this comment

SuX97 Nov 26, 2020

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment