You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When testing in the Speech Studio I am seeing exactly the behavior that I am trying to get, as in the picture. Real time speaking into the microphone gets continually transcribed and chunks of text get real-time diarized.
I tried to follow the quickstart guide on real-time diarization, but if I run it what happens is that I only get the "TRANSCRIBED" logs after the whole audio is processed and I don't get the intermediate results. If I add a listener on .transcribing this successfully logs text as it is processed, but without attributing it to a specific speakerId (speakerId is undefined). See image from my terminal. You can also see that I get all the "Transcribing" events first and the "Transcribed" events only at the end of the whole audio.
What I desire to see instead is what was shown in this announcement of real-time diarization where the transcribing events are interleaved with transcribed events once an utterance is identified.
I have searched the documentation and samples here, but seem to be unable to find anything on this. I am not sure if this is a bug or if I am missing some crucial piece of information. I would greatly appreciate any help! (I am using JavaScript)
The text was updated successfully, but these errors were encountered:
When testing in the Speech Studio I am seeing exactly the behavior that I am trying to get, as in the picture. Real time speaking into the microphone gets continually transcribed and chunks of text get real-time diarized.
I tried to follow the quickstart guide on real-time diarization, but if I run it what happens is that I only get the "TRANSCRIBED" logs after the whole audio is processed and I don't get the intermediate results. If I add a listener on
.transcribing
this successfully logs text as it is processed, but without attributing it to a specific speakerId (speakerId is undefined). See image from my terminal. You can also see that I get all the "Transcribing" events first and the "Transcribed" events only at the end of the whole audio.What I desire to see instead is what was shown in this announcement of real-time diarization where the transcribing events are interleaved with transcribed events once an utterance is identified.
I have searched the documentation and samples here, but seem to be unable to find anything on this. I am not sure if this is a bug or if I am missing some crucial piece of information. I would greatly appreciate any help! (I am using JavaScript)
The text was updated successfully, but these errors were encountered: