The same input bam, fasta and models, but different output size of merge_output.vcf.gz #303

Jerry-is-a-mouse · 2024-04-26T23:39:56Z

Hi, when I use clair3(v1.0.5) to call variants in HG002's PacBio HiFi 15-20kb chemistry2 reads, I typed the run_clair3.sh command twice in the command line, using the same inputs, but the results of merge_output.vcf.gz were of different sizes. Is it right or in other words, this result is caused by the principle and algorithm clair3 used?

aquaskyline · 2024-04-27T08:38:07Z

Could you please look into the two VCFs and see what are the differences.

Jerry-is-a-mouse · 2024-04-27T09:59:35Z

@aquaskyline I count how many variants were called using wc command as follows:
(1) The one vcf.gz I got yesterday:
less merge_output.vcf.gz | grep -v "^#" | wc -l
4443956
(2) The one vcf.gz I got about 2 months ago:
less HG002_Nanopore.vcf.gz | grep -v "^#" | wc -l
4527382
I am so sorry that the files are too big to upload.

aquaskyline · 2024-05-07T08:12:13Z

One of your files named Nanopore, but you said you were using the same PacBio HiFi input for both runs?

Jerry-is-a-mouse · 2024-05-08T06:36:23Z

Sorry，what I used is Nanopore sequencing. Because I had re-run the both type of data, so I found out that the pacbio hifi result is the same but nanopore are different.

aquaskyline · 2024-05-08T07:18:19Z

Outputs of Clair3 are deterministic. You might want to try again using the same version, model, and parameters.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The same input bam, fasta and models, but different output size of merge_output.vcf.gz #303

The same input bam, fasta and models, but different output size of merge_output.vcf.gz #303

Jerry-is-a-mouse commented Apr 26, 2024

aquaskyline commented Apr 27, 2024

Jerry-is-a-mouse commented Apr 27, 2024

aquaskyline commented May 7, 2024

Jerry-is-a-mouse commented May 8, 2024

aquaskyline commented May 8, 2024

The same input bam, fasta and models, but different output size of merge_output.vcf.gz #303

The same input bam, fasta and models, but different output size of merge_output.vcf.gz #303

Comments

Jerry-is-a-mouse commented Apr 26, 2024

aquaskyline commented Apr 27, 2024

Jerry-is-a-mouse commented Apr 27, 2024

aquaskyline commented May 7, 2024

Jerry-is-a-mouse commented May 8, 2024

aquaskyline commented May 8, 2024