TRUTH.TOTAL differs despite having used the same TRUTH SET #165

robertzeibich · 2022-10-20T00:38:40Z

Do you know why the TRUTH.TOTAL differs despite having used the same TRUTH SET?

shinlin77 · 2023-04-03T19:17:52Z

Yes, I have that same question.

opplatek · 2024-03-13T13:59:28Z

Same here. I tried:

Run full GiaB HG002 (AshkenazimTrio/HG002_NA24385_son/NISTv4.2.1/GRCh38) as both the truth and the sample
Run full GiaB HG002 (AshkenazimTrio/HG002_NA24385_son/NISTv4.2.1/GRCh38) as the truth and HG002 subset (1000 variants) as the sample.

I got a different TRUTH.TOTAL counts in the summary. However, the number of annotated variants in hap.py annotated output VCF is the same and is equal to the second test (full sample as the truth and subset as the sample).

However, it seems to apply only for vcfeval engine. With xcmp it seems to report the same numbers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TRUTH.TOTAL differs despite having used the same TRUTH SET #165

TRUTH.TOTAL differs despite having used the same TRUTH SET #165

robertzeibich commented Oct 20, 2022 •

edited

shinlin77 commented Apr 3, 2023

opplatek commented Mar 13, 2024 •

edited

TRUTH.TOTAL differs despite having used the same TRUTH SET #165

TRUTH.TOTAL differs despite having used the same TRUTH SET #165

Comments

robertzeibich commented Oct 20, 2022 • edited

shinlin77 commented Apr 3, 2023

opplatek commented Mar 13, 2024 • edited

robertzeibich commented Oct 20, 2022 •

edited

opplatek commented Mar 13, 2024 •

edited