Inconsistent hearing impaired/non-hearing impaired flagging #2386

lawadr · 2024-02-11T03:34:51Z

Describe the bug
When a subtitle is downloaded that is tagged as non-hearing impaired, but is scanned as hearing impaired, it is flagged as hearing impaired in Bazarr while saved without the "sdh" or "hi" in the file name. On rescan, it is re-flagged as non-hearing impaired in Bazarr.

To Reproduce

Find a subtitle that is flagged by the provider as non-hearing impaired but is detected as hearing impaired by Bazarr
Download the subtitle via a search or manual download
Notice that Bazarr detects it as <LANGUAGE> HI
Click "Scan Disk" and notice it getting changed to <LANGUAGE>

Expected behavior
Either it gets flagged as <LANGUAGE> and stays as <LANGUAGE> after a re-scan, or it gets saved with "sdh" or "hi" in the filename and continuously gets detected as <LANGUAGE> HI every re-scan.

Software (please complete the following information):

Bazarr: v1.4.1

The text was updated successfully, but these errors were encountered:

lawadr · 2024-02-11T03:43:50Z

It looks like the subtitle is saved to file based on what the provider thinks the language is. If it thinks it's English but not hearing impaired it'll get saved as *.en.srt for example

After that, guess_external_subtitles is called which reads the file and finds it to be hearing impaired. It is then saved to the database as being hearing impaired.

On re-scan, the language of an external subtitle is initially set based on its filename, which in this case is English non-hearing impaired as it's still just *.en.srt.

When guess_external_subtitles is called again, it does not read the file to change the language to hearing impaired because of the previously_indexed_subtitles_to_exclude argument. This makes it forever treated as non-hearing impaired.

My questions here are:

Should the provider be trusted and the file always treated as non-hearing impaired? Or should the file scan during guess_external_subtitles be consistent from then on and it always being treated as hearing impaired?
Should the result of guess_external_subtitles change the filename to match what it thinks it is?

morpheus65535 · 2024-02-11T04:11:45Z

Yeah, I've been aware of that for a while but, probably out of laziness, I never fixed it.

I think that, while downloading a subtitles file, we should always validate if it's HI or not. The file should be named accordingly, not using the language known by the provider. guess_external_subtitles should then return the same value on each execution. Make sense?

…constants. #2386

morpheus65535 · 2024-05-03T02:09:14Z

This should be fixed in upcoming beta. Keep me informed if it's as expected.

morpheus65535 self-assigned this Mar 8, 2024

morpheus65535 added a commit that referenced this issue May 3, 2024

Fixed HI subtitles identification when downloading and improved some …

2c4ed03

…constants. #2386

morpheus65535 added bug fixed labels May 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent hearing impaired/non-hearing impaired flagging #2386

Inconsistent hearing impaired/non-hearing impaired flagging #2386

lawadr commented Feb 11, 2024

lawadr commented Feb 11, 2024 •

edited

morpheus65535 commented Feb 11, 2024

morpheus65535 commented May 3, 2024

Inconsistent hearing impaired/non-hearing impaired flagging #2386

Inconsistent hearing impaired/non-hearing impaired flagging #2386

Comments

lawadr commented Feb 11, 2024

lawadr commented Feb 11, 2024 • edited

morpheus65535 commented Feb 11, 2024

morpheus65535 commented May 3, 2024

lawadr commented Feb 11, 2024 •

edited