Skip to content

How to stream diarization and transcription (Whisper) results from an audio file? #211

Answered by juanmc2005
ColtonBehannon asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @ColtonBehannon, if you already have streaming transcriptions (with timestamps), adding diarization would be a matter of running both in parallel and then aligning the output according to both sets of timestamps. I recently implemented a now outdated SpeakerAwareTranscription pipeline (see #147), but it's a bit hacky and should be improved before integrating that into the library.

Concerning the blog post, the same principles apply to the latest version of diart, only some names need to be updated. Otherwise you can simply install a previous version like v0.6 or v0.7.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@ColtonBehannon
Comment options

Answer selected by ColtonBehannon
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
question Further information is requested
2 participants