Jingju Singing Phrase Matching

The code in this repo aims to help reproduce the results in the work:

Rong Gong, Jordi Pons, and Xavier Serra. 2017. Audio to Score Matching by Combining Phonetic and Duration Information. In 18th International Society for Music Information Retrieval Conference. Suzhou, China

The objective of this research task to find the corresponding score for its singing query audio. By pre-segmenting both the singing audios and the music scores into the phrase units, we restrict this research to the "matching" scope. The matched scores could facilitate several lower-level MIR tasks, such as the score-informed automatic syllable or phoneme segmentation for singing voice.

The related code only situated in phoneticSimilarity folder. Other folders are used to test other matching methods.

Steps to reproduce the experiment results

Clone this repository
Download Jingju a capella singing dataset from http://doi.org/10.5281/zenodo.344932
Change dataset_path variable in general/filePath.py to locate the above dataset
Compile cython viterbi decoding code: go into CythonModule, in terminal type python setup.py build_ext --inplace
Python 2.7.9 and Essentia 2.1-beta 3 were used in the paper; Install dependencies in requirements.txt
Choose class_name in general/filePath.py to 'danAll' or 'laosheng' to experiment on either dan or laosheng role-type
Choose am in general/parameters.py to 'gmm' or 'cnn' to select acoustic model
Run python runHMM.py to produce the experiment results for HMM and post-processor duration modelling matching
Run python runHSMM.py to produce the experiment results for HSMM duration modelling matching
The details instructions are written in these two files above

Steps to train GMM, CNN acoustic models

Do steps 1, 2, 3 in Steps to reproduce the experiment results
To train GMM models, run python acousticModelTraining.py
To train CNN acoustic models, please follow the instructions in https://github.com/ronggong/EUSIPCO2017
After training CNN models, put them into cnnModels folders. Pre-trained models are already included.

Dependencies

numpy scipy matplotlib essentia vamp scikit-learn pinyin cython keras theano unicodecsv

License

Affero GNU General Public License version 3

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
CythonModule		CythonModule
general		general
melodicSimilarity		melodicSimilarity
phoneticSimilarity		phoneticSimilarity
roleTypeClassification		roleTypeClassification
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

CythonModule

CythonModule

general

general

melodicSimilarity

melodicSimilarity

phoneticSimilarity

phoneticSimilarity

roleTypeClassification

roleTypeClassification

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Jingju Singing Phrase Matching

Steps to reproduce the experiment results

Steps to train GMM, CNN acoustic models

Dependencies

License

About

Releases 1

Packages

Languages

License

ronggong/jingjuSingingPhraseMatching

Folders and files

Latest commit

History

Repository files navigation

Jingju Singing Phrase Matching

Steps to reproduce the experiment results

Steps to train GMM, CNN acoustic models

Dependencies

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages