Open-source Audio Datasets

What is DagsHub?

DagsHub is a centralized platform to host and manage machine learning projects including code, data, models, experiments, annotations, model registry, and more! DagsHub does the MLOps heavy lifting for its users. Every repository comes with configured S3 storage, an experiment tracking server, and an annotation workspace - all using popular open-source tools like MLflow, DVC, Git, and Label Studio.

What is Hacktoberfest?

Hacktoberfest is a month-long virtual festival of open source! Participants are giving back to the community by completing pull requests, participating in events, and donating to open-source projects. This project is part of Hacktoberfest 2023, where participants enrich the open-source audio datasets hosted on DagsHub.

Quick Start to Contribution

Sign-up to Hacktoberfest & DagsHub.
Join our Hacktoberfest 2022 Discord channel.
Read the contribution guide lines.
Create a Pull Requests on the GitHub audio-datasets repository.

What does the DagsHub community contribute?

This year we'd like to focus our contribution on the audio domain. For that, we added audio data catalog capabilities to DagsHub! You can now upload audio files to DagsHub and see its spectrogram, wave, and even listen to it! You can see a vivid example of this (extremely cool) feature in our Librispeech-ASR-corpus project.

To help audio practitioners leverage this new feature, we want to enrich open-source audio datasets on DagsHub. This is where you can contribute to the data science community!

How to contribute?

Claim the dataset you wish to contribute from the list (KUDOS to jim-schwoebel) by opening a new issue on the GitHub repository and name it after the dataset. Please make sure that the dataset wasn't claimed.
Open a new DagsHub repository and upload the data to its DVC storage (e.g., dataset repository).
Write information about the dataset in the README file (e.g., Librispeech ASR corpus README).
Add relevant tags to the repository and files.
Add the following labels to the repository:
- dataset
- audio
- hacktoberfest
In the GitHub audio-datasets project:
- Open a new branch named after the dataset.
- Add a directory named after the dataset with the README file.
- Commit and push the changes to GitHub.
- Create a pull request on GitHub.
Optional: Share the project on DagsHub Hacktoberfest 2022 Discord channel.

Name		Name	Last commit message	Last commit date
Latest commit History 153 Commits
Acted-Emotional-Speech-Dynamic-Database		Acted-Emotional-Speech-Dynamic-Database
Arabic-Speech-Corpus		Arabic-Speech-Corpus
Att-HACK		Att-HACK
AudioMNIST		AudioMNIST
BAVED		BAVED
Bird-Audio-Detection-challenge		Bird-Audio-Detection-challenge
CHiME-Home		CHiME-Home
CMU-MOSI		CMU-MOSI
CREMA-D		CREMA-D
CSD		CSD
CommonVoice		CommonVoice
Coswara		Coswara
DAPS		DAPS
Deeply_Nonverbal_Vocalization_Dataset		Deeply_Nonverbal_Vocalization_Dataset
EMODB		EMODB
EMOVO		EMOVO
ESC-50		ESC-50
EmoSynth		EmoSynth
Estonian-Emotional-Speech-Corpus		Estonian-Emotional-Speech-Corpus
FSDnoisy18k		FSDnoisy18k
FSL4		FSL4
Flickr-Audio-Caption-Corpus		Flickr-Audio-Caption-Corpus
Golos		Golos
JL-corpus		JL-corpus
LEGO-Spoken-Dialogue-Corpus		LEGO-Spoken-Dialogue-Corpus
LJ-Speech-Dataset		LJ-Speech-Dataset
MS-SNSD		MS-SNSD
Public Domain Sounds		Public Domain Sounds
RSC		RSC
Speech-Accent-Dataset		Speech-Accent-Dataset
Speech_Commands_Dataset		Speech_Commands_Dataset
TESS		TESS
URDU-Dataset		URDU-Dataset
UrbanSound8K		UrbanSound8K
VIVAE		VIVAE
WARBLRB10k		WARBLRB10k
assets		assets
free-spoken-digit-dataset		free-spoken-digit-dataset
lego-spoken-dialogue-corpus		lego-spoken-dialogue-corpus
musdb18-musdb18hq		musdb18-musdb18hq
voice_gender_detection		voice_gender_detection
zerospeech2021		zerospeech2021
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md

DagsHub/audio-datasets

Folders and files

Latest commit

History

Repository files navigation

Open-source Audio Datasets

What is DagsHub?

What is Hacktoberfest?

Quick Start to Contribution

What does the DagsHub community contribute?

How to contribute?

About

Topics

Resources

Stars

Watchers

Forks