Adding loader for Hainsworth dataset #617

tanmayy24 · 2024-01-22T21:40:34Z

Please include the following information at the top level docstring for the dataset's module mydataset.py:

Describe annotations included in the dataset
Indicate the size of the datasets (e.g. number files and duration, hours)
Mention the origin of the dataset (e.g. creator, institution)
Describe the type of music included in the dataset
Indicate any relevant papers related to the dataset
Include a description about how the data can be accessed and the license it uses (if applicable)

Dataset loaders checklist:

Create a script in scripts/, e.g. make_my_dataset_index.py, which generates an index file.
Run the script on the canonical version of the dataset and save the index in mirdata/indexes/ e.g. my_dataset_index.json.
Create a module in mirdata, e.g. mirdata/my_dataset.py
Create tests for your loader in tests/datasets/, e.g. test_my_dataset.py
Add your module to docs/source/mirdata.rst and docs/source/table.rst
Run black, flake8 and mypy (see Running your tests locally).
Run tests/test_full_dataset.py on your dataset.
Check that codecov coverage does not decrease.

If your dataset is not fully downloadable there are two extra steps you should follow:

Contacting the mirdata organizers by opening an issue or PR so we can discuss how to proceed with the closed dataset.
Show that the version used to create the checksum is the "canonical" one, either by getting the version from the dataset creator, or by verifying equivalence with several other copies of the dataset.
Make sure someone has run pytest -s tests/test_full_dataset.py --local --dataset my_dataset once on your dataset locally and confirmed it passes.

Please-do-not-edit flag

To reduce friction, we will make commits on top of contributor's pull requests by default unless they use the please-do-not-edit flag. If you don't want this to happen don't forget to add the flag when you start your pull request.

codecov · 2024-01-22T21:44:31Z

Codecov Report

Merging #617 (d6e709a) into master (c1e3cf9) will increase coverage by 0.00%.
The diff coverage is 98.03%.

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #617   +/-   ##
=======================================
  Coverage   97.07%   97.07%           
=======================================
  Files          63       64    +1     
  Lines        7341     7392   +51     
=======================================
+ Hits         7126     7176   +50     
- Misses        215      216    +1

Adding loader for hainsworth

b7f6006

tanmayy24 changed the title ~~[WIP] Adding loader for Hainsworth Dataset~~ [WIP] Adding loader for Hainsworth dataset Jan 22, 2024

rythmm24 added 3 commits January 22, 2024 17:02

Edited loader info

f8aa296

Update in audio count

4120fef

Update in doc for Hainsworth

97eb175

tanmayy24 changed the title ~~[WIP] Adding loader for Hainsworth dataset~~ Adding loader for Hainsworth dataset Jan 25, 2024

Merge branch 'master' into tanmay/hainsworth

d6e709a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding loader for Hainsworth dataset #617

Adding loader for Hainsworth dataset #617

tanmayy24 commented Jan 22, 2024 •

edited

codecov bot commented Jan 22, 2024 •

edited

Adding loader for Hainsworth dataset #617

Are you sure you want to change the base?

Adding loader for Hainsworth dataset #617

Conversation

tanmayy24 commented Jan 22, 2024 • edited

Dataset loaders checklist:

Please-do-not-edit flag

codecov bot commented Jan 22, 2024 • edited

Codecov Report

tanmayy24 commented Jan 22, 2024 •

edited

codecov bot commented Jan 22, 2024 •

edited