Audio Source Separation based on Wave-U-Net

Wave-U-Net Model

Paper Source : https://arxiv.org/abs/1806.03185

The Model Consists of 1D Convolutional layers with U-Net Architechture

The Encoder Layers are downsampled by decimation which is a general method to reduce the sampling rate of an audio data

The Decoder Layers are upsampled by simple linear interpolation with sigmoid activation function for smoothing

Take a look on the model's architechture in the paper :

Limitations

The model's performance is not as good as the other extra large Audio Source Separation models like OpenUnMix and HDEMUCS. Despite of that, this model is very feasible to run locally on mobile application after applying some optimization for mobile apps like Quantization and Pruning considering the relatively small size of the model (compared to other models).

MUSDB Dataset

The MUSDB Dataset provides a collection of multitrack music recordings specifically designed for source separation research. It consists of a diverse musical genre and provides supervised ground truth annotations for each track which is delivered in isolated sources (vocals, drums, bass, accompaniment, and others)

This Dataset is quite well-known among the researchers and practicioners in the field of Audio Signal Processing and Source Separation.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
media		media
src		src
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

media

media

src

src

.gitignore

.gitignore

README.md

README.md

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

Audio Source Separation based on Wave-U-Net

Wave-U-Net Model

Paper Source : https://arxiv.org/abs/1806.03185

Limitations

MUSDB Dataset

About

Releases

Packages

Languages

Jonathanjordan21/Audio-Source-Separation

Folders and files

Latest commit

History

Repository files navigation

Audio Source Separation based on Wave-U-Net

Wave-U-Net Model

Paper Source : https://arxiv.org/abs/1806.03185

Limitations

MUSDB Dataset

About

Topics

Resources

Stars

Watchers

Forks

Languages