Skip to content

Testing The Efficiency of WaveUNet as an Audio Source Separation model

Notifications You must be signed in to change notification settings

Jonathanjordan21/Audio-Source-Separation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Audio Source Separation based on Wave-U-Net

Wave-U-Net Model

The Model Consists of 1D Convolutional layers with U-Net Architechture Kiku

The Encoder Layers are downsampled by decimation which is a general method to reduce the sampling rate of an audio data Kiku

The Decoder Layers are upsampled by simple linear interpolation with sigmoid activation function for smoothing

Take a look on the model's architechture in the paper :
Kiku

Limitations

The model's performance is not as good as the other extra large Audio Source Separation models like OpenUnMix and HDEMUCS. Despite of that, this model is very feasible to run locally on mobile application after applying some optimization for mobile apps like Quantization and Pruning considering the relatively small size of the model (compared to other models).

MUSDB Dataset

The MUSDB Dataset provides a collection of multitrack music recordings specifically designed for source separation research. It consists of a diverse musical genre and provides supervised ground truth annotations for each track which is delivered in isolated sources (vocals, drums, bass, accompaniment, and others) Kiku

This Dataset is quite well-known among the researchers and practicioners in the field of Audio Signal Processing and Source Separation.

About

Testing The Efficiency of WaveUNet as an Audio Source Separation model

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages