Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalize confidence to the length of the track #72

Open
denis-stepanov opened this issue Apr 13, 2024 · 1 comment
Open

Normalize confidence to the length of the track #72

denis-stepanov opened this issue Apr 13, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@denis-stepanov
Copy link
Owner

Apparently, 'fingerprinted_confidence' of DejaVu is proportional to the length of the track. It could only reach 100% if the entire track fits the recognition window. It means that tracks which do not fit the window might be treated unequally with regard to their duration. E.g., for a listening window of 2 seconds:

  1. jingle of 3 seconds yielding 10% confidence --> should read 3 / 2 * 10% = 15%
  2. jingle of 6 seconds yielding 10% confidence --> should read 6 / 2 * 10% = 30%

The confidence in the result in case 1 is actually higher, because DejaVu had listened for 2/3 of the track and only for 1/3 in case 2. So, for the same "reported" confidence within a fixed window it should result in a lower number for a "real" confidence. In other words, the result should be weighted, i.e., multiplied on a coefficient "length of the track" / "window length".

Note 1: this only makes sense for windows lengths where DejaVu demonstrates reliable detection, i.e., for 2+ seconds. Going below 2 seconds would likely result in noise amplification and in many false positives.

Note 2: DejaVu does not record in its database track length required for this calculation. However, it could be inferred with reasonable precision (circa 0.1%) from fingerprint offsets. db-djv-pg tool is already doing this calculation for the purpose of statistics.

@denis-stepanov denis-stepanov added the enhancement New feature or request label Apr 13, 2024
@denis-stepanov
Copy link
Owner Author

Depends on #46 study which should be done first.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant