Annotation for Google AudioSet Screaming Events (Balanced train videos)
-
Updated
Nov 18, 2017
Annotation for Google AudioSet Screaming Events (Balanced train videos)
Gender prediction in movie audio
Source code for "Visually aligned sound generation via sound-producing motion parsing" (Published at Neurocomputing)
Google's AudioSet consistently reformatted
Re-Implementation of Google Research's VGGish model used for extracting audio features using Pytorch with GPU support.
Scripts to process Google's Audioset
Query service to serve the JibJib TensorFlow model
AudioSet classification using RNN
Codebase for Imperial MSc AI Individual Project - Self-Supervised Learning for Audio Inference
Repo accompanying the blog post "How to Deploy PyTorch Models with Core ML Conversion Issues"
Continual Learning with Gated Incremental Memories for Sequential Data Processing. IJCNN 2020. Continual Learning with Recurrent Neural Networks (RNNs) inspired by Progressive network architecture.
Repo accompanying the blog post "How to Deploy A State-of-the-art PyTorch Model to iOS via Core ML (Part 3)".
Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing
[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
Add a description, image, and links to the audioset topic page so that developers can more easily learn about it.
To associate your repository with the audioset topic, visit your repo's landing page and select "manage topics."