Skip to content

Navigation Menu

Explore
For
- Enterprise
- Teams
- Startups
- Education
By Solution
Resources
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

juliagusak / dataloaders Public

Notifications You must be signed in to change notification settings
Fork 12
Star 109

Code
Issues 1
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Breadcrumbs

dataloaders

/

README.md

Latest commit

History

41 lines (36 loc) · 3.39 KB

Breadcrumbs

dataloaders

/

README.md

File metadata and controls

41 lines (36 loc) · 3.39 KB

dataloaders

Pytorch and TFRecords data loaders for several audio datasets

Datasets

ESC - dataset of environmental sounds

ESC Downloader
Pytorch DataSet
TFRecords Loader

LibriSpeech - corpus of read English speech

LibriSpeech downloader for PyTorch
PyTorch DataSet
PyTorch DataSet for TFRecord
PyTorch DataLoaders for TFRecord
TFRecords Loader
TFRecords Generator

NSynth - dataset of annotated musical notes

NSynth downloader and generator of *.h5py and *.tfrecord formats
TFRecord reader
PyTorch Dataset
PyTorch Dataset for TFrecord
PyTorch DataLoaders for TFRecord

VoxCeleb2 - human speech, extracted from YouTube interview videos

Pytorch loader
TFRecords loader

GTZAN - audio tracks from a variety of sources annotated with genre class

GTZAN Downloader
PyTorch DataSet

CallCenter - audio tracks with human and non-human speech

PyTorch DataSet

For validation we frequently use the following scheme:

Read 10 random crops from a file;
Predict a class for each crop;
Averaging results.

For this scheme we've done additional DataLoaders for PyTorch:

DataLoader for ESC, GTZAN, LibriSpeech
DataLoader for LibriSpeech from TfRecords
DataLoaders for NSynth

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.