audiolm-pytorch-training

This repository contains my scripts to train AudioLM models, using the audiolm-pytorch library by lucidrains and the my own modifications in the personal_hacks branch. The AudioLM model is a mostly 1-1 reproduction of the paper "AudioLM: a Language Modeling Approach to Audio Generation".

Getting started

# Create necessary directories
export folder_name=itsleonwu # rename this for your own purposes
mkdir /fsx/$folder_name 
cd /fsx/$folder_name 
mkdir audiolm-pytorch-results
mkdir audiolm-pytorch-datasets

# Clone the audiolm-pytorch-training repository
git clone https://github.com/LWprogramming/audiolm-pytorch-training.git

# Create a virtual environment using Python 3.10 and activate it
cd audiolm-pytorch-training
python3.10 -m venv venv
source venv/bin/activate

# Run the hubert_ckpt_download.py script
python hubert_ckpt_download.py

# Run the use_patched_audiolm.py script
python use_patched_audiolm.py personal_hacks

pip install tensorboardX # for some reason not covered in the previous pip install?? installing separately

# Download the dataset
cd ../audiolm-pytorch-datasets
wget https://www.openslr.org/resources/12/dev-clean.tar.gz
tar -xvf dev-clean.tar.gz
mv LibriSpeech LibriSpeech-dev-clean
rm dev-clean.tar.gz

# Create a directory for the sample file
mkdir many_identical_copies_of_cocochorales_single_sample_resampled_24kHz_trimmed_first_second
echo "Remember to upload the sample file to this overfitting dataset!"

Name		Name	Last commit message	Last commit date
Latest commit History 287 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
audiolm_e2e_replace_hubert_and_soundstream.ipynb		audiolm_e2e_replace_hubert_and_soundstream.ipynb
audiolm_hubert_vs_mert.ipynb		audiolm_hubert_vs_mert.ipynb
audiolm_pytorch_demo_backup.ipynb		audiolm_pytorch_demo_backup.ipynb
audiolm_pytorch_demo_laion.py		audiolm_pytorch_demo_laion.py
audiolm_pytorch_soundstream_single_cell.ipynb		audiolm_pytorch_soundstream_single_cell.ipynb
aws_ckpt_backup_script.py		aws_ckpt_backup_script.py
clear_previous_results.py		clear_previous_results.py
coarse_fine_explore.ipynb		coarse_fine_explore.ipynb
cocochorales_custom_dataset.py		cocochorales_custom_dataset.py
cocochorales_downloader.sh		cocochorales_downloader.sh
data_analysis.py		data_analysis.py
encodec_test.ipynb		encodec_test.ipynb
hubert_ckpt_download.py		hubert_ckpt_download.py
memory_usage.py		memory_usage.py
preemption.py		preemption.py
sbatch_job.sh		sbatch_job.sh
script.py		script.py
spectrogram_visualizer.py		spectrogram_visualizer.py
use_patched_audiolm.py		use_patched_audiolm.py
wait_for_everyone_hang.py		wait_for_everyone_hang.py
wait_for_everyone_hang_job.sh		wait_for_everyone_hang_job.sh

License

LWprogramming/audiolm-pytorch-training

Folders and files

Latest commit

History

Repository files navigation

audiolm-pytorch-training

Getting started

About

Resources

License

Stars

Watchers

Forks

Languages