Replies: 1 comment 2 replies
-
Did you read this? This should answer most of your questions about this model. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
pyannote for speaker diarization based on the following segmentation model:
End-to-end speaker segmentation for overlap-aware resegmentation
In the above paper they wrote, under the Implementation details:
[i.e: 5s audio chunks with a sampling rate of 16kHz]
K max -dimensional speaker activations between 0 and 1 every 16ms.
How to conclude if we have a new segment or number of segments in each output ? number of speaker in output ? (example will be very helpful)
Beta Was this translation helpful? Give feedback.
All reactions