New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
vt cannot retrieve sequences from my reference sequence file #84
Comments
AL123456 is not a chromosome in hg19.fasta I believe. You should use the reference that was used to generate your VCF file. |
After downloading the reference genome from ncbi genome still getting the same error
|
GRCh37_latest_genomic.fna.gz has its chromosomes not named consistently with the chromosomes in your VCF file. They are prepended with NC_, NW_ and NT_. You need to use a reference file where the sequences have a name consistent with what is found in your VCF file. |
@atks I had a similar issue. Below is is my error When I used the -n flag vt skipped several variants |
Deleted the reference index file.
Downloaded reference genome from the UCSC database.
But still getting the same error.
vt normalize HF.final.vcf.gz -r ref/hg19.fasta -o normalizeHF.final.vcf.gz
[variant_manip.cpp:72 is_not_ref_consistent] failure to extract base from fasta file: AL123456:24697-24715
FAQ: http://genome.sph.umich.edu/wiki/Vt#1._vt_cannot_retrieve_sequences_from_my_reference_sequence_file
The text was updated successfully, but these errors were encountered: