Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Faster-whisper issue with the latest NVIDIA 55x series drivers #829

Open
pasinduranjula opened this issue May 10, 2024 · 1 comment
Open

Comments

@pasinduranjula
Copy link

Faster-whisper on CUDA doesn't properly terminate with the latest NVIDIA 551 and 552 drivers on Windows. It hangs for a while and triggers an automatic Microsoft crash submission before terminating. The issues occurs with both CUDA 11 and 12, but works without issues on all previous NVIDIA drivers (e.g. 546).

A simple code like the following can trigger the issue:

`from faster_whisper import WhisperModel

model = WhisperModel("modelpath", device="cuda", compute_type="int8_float16")`

The issue seems to trigger when releasing the CUDA memory allocated for the model.

Tested with multiple faster-whisper versions (both CUDA 11 and 12).

@pasinduranjula
Copy link
Author

pasinduranjula commented May 12, 2024

The issue seems to be in the ctranslate2 library. There are some CUDA API calls in the destructors that cause the issue. https://github.com/OpenNMT/CTranslate2/blob/v4.2.1/src/cuda/utils.cc

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant