Generative Modelling for Controllable Audio Synthesis of Piano Performance

This repository contains code for the paper "Generative Modelling for Controllable Audio Synthesis of Piano Performance" by Hao Hao Tan, Yin-Jyun Luo and Dorien Herremans.

We utilize Gaussian Mixture VAEs in neural audio synthesis models to allow temporal conditioning of two essential style features for piano performances: articulation and dynamics.

Train spectrogram model

Download the MAESTRO v2.0.0 dataset.
Modify the training configurations in nms_latent_config.json.
Run python trainer_nms_latent_dynamic.py.
The trained model weights and logs can be found in params/ and logs/ folder respectively.

After training completes, follow visualize.ipynb to observe the controllable generation of spectrograms under different degrees of articulation / dynamics.

Train WaveGlow spectrogram inversion model

For details on training WaveGlow, kindly refer to: https://github.com/yjlolo/constant-memory-waveglow

Resources

Citation

This research work is published at the ICML ML4MD Workshop, 2020.

@inproceedings{tan20generative,
  author = {Tan, Hao Hao and Luo, Yin-Jyun and Herremans, Dorien},
  booktitle = {ICML Workshop on Machine Learning for Music Discovery Workshop (ML4MD), Extended Abstract},
  title = {Generative Modelling for Controllable Audio Synthesis of Piano Performance},
  year = {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
img		img
README.md		README.md
dataset.py		dataset.py
model_nms_latent.py		model_nms_latent.py
nms_latent_config.json		nms_latent_config.json
trainer_nms_latent_dynamic.py		trainer_nms_latent_dynamic.py
utils.py		utils.py
visualize.ipynb		visualize.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

README.md

README.md

dataset.py

dataset.py

model_nms_latent.py

model_nms_latent.py

nms_latent_config.json

nms_latent_config.json

trainer_nms_latent_dynamic.py

trainer_nms_latent_dynamic.py

utils.py

utils.py

visualize.ipynb

visualize.ipynb

Repository files navigation

Generative Modelling for Controllable Audio Synthesis of Piano Performance

Train spectrogram model

Train WaveGlow spectrogram inversion model

Resources

Citation

About

Releases

Packages

Languages

gudgud96/piano-synthesis

Folders and files

Latest commit

History

Repository files navigation

Generative Modelling for Controllable Audio Synthesis of Piano Performance

Train spectrogram model

Train WaveGlow spectrogram inversion model

Resources

Citation

About

Resources

Stars

Watchers

Forks

Languages