Skip to content

Dealing with overlapping speech #1157

Answered by cetiny
bfavero asked this question in Q&A
Discussion options

You must be logged in to vote

I have a solution that works 95% of the time with some post-processing of the diarization with pandas. My goal is to have zero overlaps in the final dataframe.

  • First distinguish between full overlaps (like the 2 first in your example) and partial overlaps (like your last example at the end of first segment)
  • General: Delete all segments that are shorter than 0.5 seconds (mostly "hmm" and short "yes" while the other speaker is speaking)
  • Full overlap: Delete all segments that are shorter than 1 seconds (mostly speaking too soon and not continueing before 1st speaker finishes)
  • Full overlap: Longer segments. I divide the longer segment into two, the shorter segment intercepts it and overwrites.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@tpstps
Comment options

Answer selected by bfavero
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #1154 on November 17, 2022 07:55.