Skip to content

asigalov61/Los-Angeles-MIDI-Dataset

Repository files navigation

Los Angeles MIDI Dataset

SOTA kilo-scale MIDI dataset for MIR and Music AI purposes


Vintage_Los_Angeles_Print


Search and Explore Los Angeles MIDI dataset

Open In Colab


[NEW] Master MIDI Dataset GPU Search and Filter

Open In Colab


Master MIDI Dataset Search and Filter

Open In Colab


Make your own Los Angeles MIDI Dataset from any MIDI scrape

Open In Colab


Make your own Los Angeles MIDI Dataset Metadata

Open In Colab



Main Features:

1) ~405000 100% unique MIDIs to explore :)

2) Each MIDI file was read-checked and 100% de-duped

3) Extensive meta-data for each MIDI file

4) Full chords data for each MIDI file

5) Helper Python code


NEW in version 4.0

1) Added 160519 new unique MIDIs

2) Dataset now contains 404714 MIDIs

3) Removed all malformed MIDIs

4) Expanded dataset MIDIs metadata

5) Added MIDIs chords database

6) Updated dataset concept artwork

Enjoy! :)


@inproceedings{lev2024losangelesmididataset,
    title       = {Los Angeles MIDI Dataset: SOTA kilo-scale MIDI dataset for MIR and Music AI purposes},
    author      = {Aleksandr Lev},
    booktitle   = {GitHub},
    year        = {2024},
}

Project Los Angeles

Tegridy Code 2024