Web UI for WhisperSpeech (https://github.com/collabora/WhisperSpeech)
Name | Info |
---|---|
CPU | AMD Ryzen 7900X3D (iGPU disabled in BIOS) |
GPU | AMD Radeon 7900XTX |
RAM | 64GB DDR5 6600MHz |
Motherboard | ASRock B650E PG Riptide WiFi (2.10) |
OS | Ubuntu 22.04 |
Kernel | 6.5.0-28-generic |
ROCm | 6.1 |
-
Install Python 3.11
-
Clone repository
-
Mount the repository directory.
-
Create and activate venv
-
For ROCm set HSA_OVERRIDE_GFX_VERSION.
- For the Radeon 7900XTX:
export HSA_OVERRIDE_GFX_VERSION=11.0.0
- Install requirements
- ROCm 5.7:
pip install -r requirements_rocm_5.7.txt
pip install git+https://github.com/ROCmSoftwarePlatform/flash-attention.git@ae7928c5aed53cf6e75cc792baa9126b2abfcf1a
- ROCm 6.0:
pip install -r requirements_rocm_6.0.txt
pip install git+https://github.com/ROCmSoftwarePlatform/flash-attention.git@2554f490101742ccdc56620a938f847f61754be6
- CUDA 11.8 (Tested on Ubuntu 23.10):
pip install -r requrements_cuda_11.8.txt
- CUDA 12.1 (Tested on Ubuntu 23.10):
pip install -r requrements_cuda_12.1.txt
- Run:
python webui.py
- With -h or --help for help:
python webui.py -h