Google's AudioSet consistently reformatted
-
Updated
Sep 21, 2022 - Python
Google's AudioSet consistently reformatted
Re-Implementation of Google Research's VGGish model used for extracting audio features using Pytorch with GPU support.
Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)
Repo accompanying the blog post "How to Deploy PyTorch Models with Core ML Conversion Issues"
Annotation for Google AudioSet Screaming Events (Balanced train videos)
Scripts to process Google's Audioset
Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing
Codebase for Imperial MSc AI Individual Project - Self-Supervised Learning for Audio Inference
[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
Repo accompanying the blog post "How to Deploy A State-of-the-art PyTorch Model to iOS via Core ML (Part 3)".
AudioSet classification using RNN
Query service to serve the JibJib TensorFlow model
Gender prediction in movie audio
Continual Learning with Gated Incremental Memories for Sequential Data Processing. IJCNN 2020. Continual Learning with Recurrent Neural Networks (RNNs) inspired by Progressive network architecture.
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
Machine learning model for bird songs recognition
Add a description, image, and links to the audioset topic page so that developers can more easily learn about it.
To associate your repository with the audioset topic, visit your repo's landing page and select "manage topics."