Skip to content

thomasvvugt/whisperx

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commits
 
 
 
 
 
 

Repository files navigation

This repository provides WhisperX in a Docker image.

The image installs Python 3.10 on Ubuntu 22.10 base images, as well as pre-requisites such as PyTorch.

Using the image

WhisperX can be ran on CPU or GPU, using either of the following Docker images.

CPU-only

docker run -it --rm -v /path/to/audio_files:/app thomasvvugt/whisperx:cpu recording.mp3 --batch_size 4 --compute_type int8

Nvidia CUDA 11.8

Note: You require a hardware graphics card (GPU), but this is much more performant.

docker run -it --rm --gpus all -v /path/to/audio_files:/app thomasvvugt/whisperx:cuda118 recording.mp3 --batch_size 8 --diarize --hf_token YOUR_HUGGINGFACE_READ_TOKEN

Building the image

You can either use the pre-built images from Docker Hub, or build the images yourself.

docker build -f Dockerfile.cpu -t whisperx:cpu
docker build -f Dockerfile.cuda118 -t whisperx:cuda118

About

Docker image for WhisperX by Max Bain

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published