Duplicate words #811

vincaslt · 2024-04-26T18:40:22Z

I'm transcribing a relatively long video and I'm often get a bunch of duplicated words for the same timestamp, e.g.:

 talk about the frameworks that entrepreneurs can use to think about how the ad value, and how it's the balance balance balance balance balance balance balance balance balance balance balance balance. what most entrepreneurs do wrong.

I use distil-large-v2 model with faster-whisper standalone executable. Here are the arguments I'm passing into faster-whisper.

  const whisperArgs = [
    `"${audioFilePath}"`,
    "--beep_off",
    "--model",
    "distil-large-v2",
    "--language",
    "en",
    "--word_timestamps",
    "True",
    "--output_format",
    "json",
    "--output_dir",
    `"${opts.resourceManager.tempDir}"`,
    "--model_dir",
    `"${opts.resourceManager.appDataDir}"`,
    "--beam_size",
    "1",
    "--one_word",
    "2",
    "--verbose",
    opts.verbose ? "True" : "False",
    opts.verbose ? "" : "--print_progress",
  ].filter(Boolean);

I saw a relevant discussion, but it proposed a fix already, which did not fix the issue for me: #716

I made sure I'm on the latest version as of today. I also tried playing around with beam_size setting, but no effect, just slower transcription. I need the one_word setting, though it might be causing the issue, but haven't tested yet (might test it later). The video I'm testing with is this one: https://www.youtube.com/watch?v=q3xN1iYeTNI (downloaded with youtube-dl)

The text was updated successfully, but these errors were encountered:

Purfview · 2024-04-26T19:52:52Z

I use distil-large-v2 model with faster-whisper standalone executable.

Then you are posting in the wrong repo.
Try standard model, medium or large-v2, or --hallucination_silence_threshold 2.
Imo, the distil models are not good for the long form transcriptions.

I need the one_word setting, though it might be causing the issue

It can't cause any issue as it's just srt/vtt writing setting and it has no effect in your example as output there is json.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Duplicate words #811

Duplicate words #811

vincaslt commented Apr 26, 2024

Purfview commented Apr 26, 2024

Duplicate words #811

Duplicate words #811

Comments

vincaslt commented Apr 26, 2024

Purfview commented Apr 26, 2024