Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vt cannot retrieve sequences from my reference sequence file #84

Open
alrafaykhan opened this issue Mar 21, 2018 · 4 comments
Open

vt cannot retrieve sequences from my reference sequence file #84

alrafaykhan opened this issue Mar 21, 2018 · 4 comments

Comments

@alrafaykhan
Copy link

Deleted the reference index file.
Downloaded reference genome from the UCSC database.
But still getting the same error.
vt normalize HF.final.vcf.gz -r ref/hg19.fasta -o normalizeHF.final.vcf.gz

[variant_manip.cpp:72 is_not_ref_consistent] failure to extract base from fasta file: AL123456:24697-24715
FAQ: http://genome.sph.umich.edu/wiki/Vt#1._vt_cannot_retrieve_sequences_from_my_reference_sequence_file

@atks
Copy link
Owner

atks commented Mar 21, 2018

AL123456 is not a chromosome in hg19.fasta I believe. You should use the reference that was used to generate your VCF file.

@alrafaykhan
Copy link
Author

After downloading the reference genome from ncbi genome still getting the same error
wget ftp://ftp.ncbi.nlm.nih.gov/refseq/H_sapiens/annotation/GRCh37_latest/refseq_identifiers/GRCh37_latest_genomic.fna.gz

vt normalize HF.final.vcf.gz -r ref/GRCh37_latest_genomic.fna.gz -o normalizeHF.final.vcf.gz

[variant_manip.cpp:72 is_not_ref_consistent] failure to extract base from fasta file: 1:866510-866510
FAQ: http://genome.sph.umich.edu/wiki/Vt#1._vt_cannot_retrieve_sequences_from_my_reference_sequence_file

@atks
Copy link
Owner

atks commented Mar 23, 2018

@alrafaykhan

GRCh37_latest_genomic.fna.gz has its chromosomes not named consistently with the chromosomes in your VCF file. They are prepended with NC_, NW_ and NT_.

You need to use a reference file where the sequences have a name consistent with what is found in your VCF file.

@vappiah
Copy link

vappiah commented Jun 11, 2020

@atks I had a similar issue. Below is is my error
[variant_manip.cpp:96 is_not_ref_consistent] reference bases not consistent: chr10:4868478-4868486 TGCGGGGCG(REF) vs Tgcggggcg(FASTA)

When I used the -n flag vt skipped several variants

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants