POC: DO NOT MERGE (speech-to-text) - Whisper #71

rickstaa · 2024-04-27T07:30:48Z

This pull request contains a quick proof of concept to showcase how easy it is to add a new pipeline. I added the speech-to-text pipeline in this example using openai/whisper-large-v3. The beutifull thing about this pipeline is that it can be cold served since model load times are < 3s. Further it only requires 6.5 GB Vram and therefore can be done on lower VRam cards.

You can test it out using audio samples from https://audio-samples.github.io/ and starting up a pipeline using the Runner documentation. You can then execute the pipeline running on https://localhost:8000/docs.

Warning

NOT PRODUCTION READY DO NOT MERGE.

This commit contains a quick proof of concept to showcase how easy it is to add a new pipeline.

POC: DO NOT MERGE (speech-to-text)

d3fac30

This commit contains a quick proof of concept to showcase how easy it is to add a new pipeline.

rickstaa marked this pull request as draft April 27, 2024 08:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

POC: DO NOT MERGE (speech-to-text) - Whisper #71

POC: DO NOT MERGE (speech-to-text) - Whisper #71

rickstaa commented Apr 27, 2024 •

edited

POC: DO NOT MERGE (speech-to-text) - Whisper #71

Are you sure you want to change the base?

POC: DO NOT MERGE (speech-to-text) - Whisper #71

Conversation

rickstaa commented Apr 27, 2024 • edited

rickstaa commented Apr 27, 2024 •

edited