Audio inpainting with a context encoder

This project accompanies the research work on audio inpainting of small gaps done at the Acoustics Research Institute in Vienna collaborating with the Swiss Data Science Center. The paper was published at IEEE TASLP available now: https://ieeexplore.ieee.org/document/8867915.

Installation

Install the requirements with pip install -r requirements.txt. For windows users, the numpy version should be 1.14.0+mkl (find it here). For the FMA dataset, librosa requires ffmpeg as an mp3 backend.

Instructions

The paper uses both google's Nsynth dataset and the FMA dataset. In order to recreate the used dataset, execute in the parent folder either python make_nsynthdataset.py or python make_fmadataset.py. The output of the scripts are three tfrecord files for training, validating and testing the model.

The default parameters for the network come pickled in the file magnitude_network_parameters.pkl and complex_network_parameters.pkl. In order to make other architectures use saveParameters.py.

To train the network, execute in the parent folder python trainMagnitudeNetwork.py or python trainComplexNetwork.py. This will train the network for 600k steps with a learning rate of 1e-3. You can select on which tfrecords to train the network, the script assumes you have created the nsynth dataset.

Sound examples

To hear examples please go to the accompanying website.

Name		Name	Last commit message	Last commit date
Latest commit History 326 Commits
LPC-based extrapolation		LPC-based extrapolation
architecture		architecture
audio_examples		audio_examples
datasetGenerator		datasetGenerator
images		images
network		network
papersNotebooks		papersNotebooks
system		system
utils		utils
.gitignore		.gitignore
README.md		README.md
SpecDivExperimentMag.m		SpecDivExperimentMag.m
_config.yml		_config.yml
complex_network_parameters.pkl		complex_network_parameters.pkl
index.html		index.html
magnitude_network_parameters.pkl		magnitude_network_parameters.pkl
make_fakedataset.py		make_fakedataset.py
make_fmadataset.py		make_fmadataset.py
make_nsynthdataset.py		make_nsynthdataset.py
requirements.txt		requirements.txt
test hear .ipynb		test hear .ipynb
trainComplexNetwork.py		trainComplexNetwork.py
trainMagnitudeNetwork.py		trainMagnitudeNetwork.py
w3.css		w3.css

andimarafioti/audioContextEncoder

Folders and files

Latest commit

History

Repository files navigation

Audio inpainting with a context encoder

Installation

Instructions

Sound examples

About

Topics

Resources

Stars

Watchers

Forks

Languages