On Adversarial Robustness of Large-scale Audio Visual Learning

repository for our ICASSP 2022 paper:http://arxiv.org/abs/2203.12122 also covers our ICASSP 2021 paper: https://arxiv.org/abs/2011.07430 (Audio-Visual Event Recognition through the lens of Adversary)

TL;DR: Watch our video at: https://www.youtube.com/watch?v=KQceFzZe7rg Slides: https://sigport.org/sites/default/files/docs/icassp2022_slides.pdf Brief intro to the pipeline:

You need 64x400 precomputed feature to run this pipeline, stored in .h5 file format, we are still figuring out where to host them. Our loader is optimized for this precomputed feature, for computing feature from .wav on the fly see our new implementation.

To train: see the tune.sh file and pick a model you want to train

To test: see checkPerformance-xxx.ipynb

This repo contains a lot of scrap material for our experiments, for training a model, we suggest you go to our newer version of implementation. The newer version of implementation can be found at: https://github.com/lijuncheng16/AudioTaggingDoneRight This repository is good for testing your pre-trained models.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
Analysis		Analysis
audioset_strong_label		audioset_strong_label
--BfvyPmVMo.wav		--BfvyPmVMo.wav
.gitignore		.gitignore
AST.py		AST.py
Adversary_MMTLF-pgd-audio-rerun-Copy1.ipynb		Adversary_MMTLF-pgd-audio-rerun-Copy1.ipynb
Adversary_MMTLF-pgd-audio.ipynb		Adversary_MMTLF-pgd-audio.ipynb
AttackOcclude_AST_trained_elsewhere-Copy1.ipynb		AttackOcclude_AST_trained_elsewhere-Copy1.ipynb
AttackOcclude_AST_trained_elsewhere.ipynb		AttackOcclude_AST_trained_elsewhere.ipynb
AttackOcclude_TAL_trained_elsewhere.ipynb		AttackOcclude_TAL_trained_elsewhere.ipynb
AttackOcclude_TAL_trans_trained_elsewhere.ipynb		AttackOcclude_TAL_trans_trained_elsewhere.ipynb
AttackOcclude_TALtrans_trained_elsewhere.ipynb		AttackOcclude_TALtrans_trained_elsewhere.ipynb
AttackOcclude_resnet_no_pretrained_elsewhere.ipynb		AttackOcclude_resnet_no_pretrained_elsewhere.ipynb
AttackOcclude_resnet_trained_elsewhere.ipynb		AttackOcclude_resnet_trained_elsewhere.ipynb
AudioResNet.py		AudioResNet.py
ConvertFlac2Wav.py		ConvertFlac2Wav.py
HigherModels.py		HigherModels.py
Load_TAL_trained_elsewhere.ipynb		Load_TAL_trained_elsewhere.ipynb
Load_TALtrans_trained_elsewhere.ipynb		Load_TALtrans_trained_elsewhere.ipynb
Load_resnet_trained_elsewhere.ipynb		Load_resnet_trained_elsewhere.ipynb
Net_mModal.py		Net_mModal.py
Net_mModal_mgpu.py		Net_mModal_mgpu.py
README.md		README.md
TALNetModel.py		TALNetModel.py
ast_models_original.py		ast_models_original.py
checkPerformance-MMT.ipynb		checkPerformance-MMT.ipynb
checkPerformance-reproduce-early.ipynb		checkPerformance-reproduce-early.ipynb
checkPerformance-reproduce-video_only.ipynb		checkPerformance-reproduce-video_only.ipynb
checkPerformance-reproduce_AST.ipynb		checkPerformance-reproduce_AST.ipynb
checkPerformance-reproduce_AST_trained_elsewhere.ipynb		checkPerformance-reproduce_AST_trained_elsewhere.ipynb
checkPerformance-reproduce_CNNtrans-mpgu.ipynb		checkPerformance-reproduce_CNNtrans-mpgu.ipynb
checkPerformance-reproduce_CNNtrans.ipynb		checkPerformance-reproduce_CNNtrans.ipynb
checkPerformance-reproduce_CRNN.ipynb		checkPerformance-reproduce_CRNN.ipynb
checkPerformance-reproduce_MMTLF.ipynb		checkPerformance-reproduce_MMTLF.ipynb
checkPerformance-reproduce_resnet.ipynb		checkPerformance-reproduce_resnet.ipynb
checkPerformance-reproduce_resnet34.ipynb		checkPerformance-reproduce_resnet34.ipynb
checkPerformance-reproduce_resnet50_mgpu.ipynb		checkPerformance-reproduce_resnet50_mgpu.ipynb
eval_new.py		eval_new.py
min_max_values.pkl		min_max_values.pkl
model_video.py		model_video.py
normalizer.pkl		normalizer.pkl
pslaModels.py		pslaModels.py
strong_label_masking.ipynb		strong_label_masking.ipynb
train_multimodal_late_fusion-resnet.py		train_multimodal_late_fusion-resnet.py
train_multimodal_late_fusion.py		train_multimodal_late_fusion.py
tune.sh		tune.sh
util_f1.py		util_f1.py
util_in_multi_h5_unnorm.py		util_in_multi_h5_unnorm.py
util_out.py		util_out.py
util_plot.py		util_plot.py
video_stats.pkl		video_stats.pkl
vis_feature-Copy1.ipynb		vis_feature-Copy1.ipynb

lijuncheng16/AudioSetDoneRight

Folders and files

Latest commit

History

Repository files navigation

On Adversarial Robustness of Large-scale Audio Visual Learning

About

Resources

Stars

Watchers

Forks

Languages