Pytorch port of Google Research's VGGish model used for extracting audio features.
-
Updated
Nov 3, 2021 - Python
Pytorch port of Google Research's VGGish model used for extracting audio features.
Audio classification with VGGish as feature extractor in TensorFlow
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
This package aims at simplifying the download of the AudioSet dataset.
Sound augmentation using Large-scale audio dataset (Audioset)
A library built for easier audio self-supervised training, downstream tasks evaluation
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
Machine learning model for bird songs recognition
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
Gender prediction in movie audio
Continual Learning with Gated Incremental Memories for Sequential Data Processing. IJCNN 2020. Continual Learning with Recurrent Neural Networks (RNNs) inspired by Progressive network architecture.
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
Repo accompanying the blog post "How to Deploy A State-of-the-art PyTorch Model to iOS via Core ML (Part 3)".
AudioSet classification using RNN
Query service to serve the JibJib TensorFlow model
Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing
Add a description, image, and links to the audioset topic page so that developers can more easily learn about it.
To associate your repository with the audioset topic, visit your repo's landing page and select "manage topics."