Faster-whisper issue with the latest NVIDIA 55x series drivers #829

pasinduranjula · 2024-05-10T18:22:42Z

Faster-whisper on CUDA doesn't properly terminate with the latest NVIDIA 551 and 552 drivers on Windows. It hangs for a while and triggers an automatic Microsoft crash submission before terminating. The issues occurs with both CUDA 11 and 12, but works without issues on all previous NVIDIA drivers (e.g. 546).

A simple code like the following can trigger the issue:

`from faster_whisper import WhisperModel

model = WhisperModel("modelpath", device="cuda", compute_type="int8_float16")`

The issue seems to trigger when releasing the CUDA memory allocated for the model.

Tested with multiple faster-whisper versions (both CUDA 11 and 12).

pasinduranjula · 2024-05-12T12:56:36Z

The issue seems to be in the ctranslate2 library. There are some CUDA API calls in the destructors that cause the issue. https://github.com/OpenNMT/CTranslate2/blob/v4.2.1/src/cuda/utils.cc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster-whisper issue with the latest NVIDIA 55x series drivers #829

Faster-whisper issue with the latest NVIDIA 55x series drivers #829

pasinduranjula commented May 10, 2024

pasinduranjula commented May 12, 2024 •

edited

Faster-whisper issue with the latest NVIDIA 55x series drivers #829

Faster-whisper issue with the latest NVIDIA 55x series drivers #829

Comments

pasinduranjula commented May 10, 2024

pasinduranjula commented May 12, 2024 • edited

pasinduranjula commented May 12, 2024 •

edited