Reference

WALNet Weak Label Analysis

This is an implementation for the paper "A Closer Look at Weak Label Learning for Audio Events". In this paper, we attempt to understand the challenges of large scale Audio Event Detection (AED) using weakly labeled data through a CNN based framework. Our network architecture is capable of handling variable length recordings and architecture design provides a way to control segment size of adjustable secondary outputs and thus these features eliminate the need for additional preprocessing steps. We look into how label density and label corruption affects performance and further compare mined web data as training data in comparison with manually labelled training data from AudioSet. We believe our work provides an approach to understand the challenges of weakly labeled learning and future AED works would benefit from our exploration.

We provide the Audioset data (list of files used in our experimentation) provided for reproducibility.

WALNet Architecture Diagram

Reference

If you use our repository or feature representation for your research WALNet- weak label analysis, please cite our paper:

@article{shah2018closer,
title={A Closer Look at Weak Label Learning for Audio Events},
author={Shah, Ankit and Kumar, Anurag and Hauptmann, Alexander G and Raj, Bhiksha},
journal={arXiv preprint arXiv:1804.09288},
year={2018}
}

Latest Results -

Use these numbers while reporting - arXiV paper update coming soon as version 2

Training Set	MAP on Testing
AudioSet - 10	22.87
AudioSetAt30	22.42
AudioSetAt60	22.42

Mean Average Precision on Audioset - 10 second recordings

Model	MAP
ConvNet (mean pooling)	20.3
ResNet (mean pooling)	21.8
ResNet-ATT [Xu et al., 2017a]	22.0
ResNet-SPDA [Zhang et al., 2016]	21.9
Mmnet [Chou et al., 2018]	22.6
WALNet [Shah.et.al, 2018]	22.9

ESC-50 dataset	MAP
SoundNet	74.2
WALNet	83.5

Questions

Contact Ankit Shah (aps1@andrew.cmu.edu) or Anurag Kumar (alnu@andrew.cmu.edu)

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
classifier		classifier
download		download
feature_extraction		feature_extraction
lists		lists
model		model
utils		utils
LICENSE		LICENSE
README.md		README.md
WALNet_Architecture_DIagram.jpg		WALNet_Architecture_DIagram.jpg
walnet.yml		walnet.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

classifier

classifier

download

download

feature_extraction

feature_extraction

lists

lists

model

model

utils

utils

LICENSE

LICENSE

README.md

README.md

WALNet_Architecture_DIagram.jpg

WALNet_Architecture_DIagram.jpg

walnet.yml

walnet.yml

Repository files navigation

WALNet Weak Label Analysis

WALNet Architecture Diagram

Web Page for More Details on Experimentation: -

Feature Level Representation - Download here

Embedding level feature representation - Download here

Reference

Latest Results -

Use these numbers while reporting - arXiV paper update coming soon as version 2

Mean Average Precision on Audioset - 10 second recordings

Questions

About

Releases

Packages

Contributors 3

Languages

License

ankitshah009/WALNet-Weak_Label_Analysis

Folders and files

Latest commit

History

Repository files navigation

WALNet Weak Label Analysis

WALNet Architecture Diagram

Web Page for More Details on Experimentation: -

Feature Level Representation - Download here

Embedding level feature representation - Download here

Reference

Latest Results -

Use these numbers while reporting - arXiV paper update coming soon as version 2

Mean Average Precision on Audioset - 10 second recordings

Questions

About

Resources

License

Stars

Watchers

Forks

Languages