Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NCBI Database Download Error #313

Open
EmmaMills101 opened this issue Mar 15, 2021 · 4 comments
Open

NCBI Database Download Error #313

EmmaMills101 opened this issue Mar 15, 2021 · 4 comments

Comments

@EmmaMills101
Copy link

Im trying to download the NCBI reference database and keep getting this error. I did force Biopython 1.76 to deal with the BioAlphabet issue, but that is the only modification I made. Thanks in advance

Processing record 392 of 5966 (accession NG_048387.1)
'AP018746.1'
gb_feature.qualifer not found
Traceback (most recent call last):
File "/Users/emma/anaconda3/envs/ariba/bin/ariba", line 312, in
args.func(args)
File "/Users/emma/anaconda3/envs/ariba/lib/python3.6/site-packages/ariba/tasks/getref.py", line 11, in run
getter.run(options.outprefix)
File "/Users/emma/anaconda3/envs/ariba/lib/python3.6/site-packages/ariba/ref_genes_getter.py", line 664, in run
exec('self.get_from' + self.ref_db + '(outprefix)')
File "", line 1, in
File "/Users/emma/anaconda3/envs/ariba/lib/python3.6/site-packages/ariba/ref_genes_getter.py", line 650, in _get_from_ncbi
id=f"{id[0]}.{accession}",
TypeError: 'NoneType' object is not subscriptable

@learithe
Copy link

I am also getting this error.

@bajwamoneeb
Copy link

Me too..

@rpetit3
Copy link
Contributor

rpetit3 commented Jul 9, 2021

Looking into that sequence, the issue is the GenBank file has "locus_tag" in the "note" field. I think this might be an issue to bring up with NCBI as well.

     CDS             101..820
                     /gene="repE"
                     /inference="COORDINATES: ab initio
                     prediction:Prodigal:2.6.3"
                     /inference="similar to AA sequence:UniProtKB:P62539"
     here------->    /note="locus_tag:MRY14229_p30010"
                     /codon_start=1
                     /transl_table=11
                     /product="replication initiation protein"
                     /protein_id="BBE81316.1"
                     /translation="MDKLLNKKIKVKQSNELTEAAYYLSLKAKRVLWLCLMQTYFTAS
                     VSEDDDEMAVLGDSTFKVKVADYEQIFQVSRNQAIKDVKEGVFELSRSAVIFYPKEGS
                     FDCVARPWLTEAGSRSARGIWEIEFNHKLLRYIYGLTNQFTTYSLRDCGSLRNPRTIR
                     LYESLAQFKSSGLWVTTHAWLNDRFLLPESQQKNLAELKRSFLDPALKQINEKTPLLA
                     KYSIDDSGKFLFSIIDKQNPV"

@cpeeters
Copy link

I have the same issue when trying to download latest version of ncbi reference db, but now on other accession=

Processing record 795 of 6243 (accession NG_048402.1)
'AP018746.1'
gb_feature.qualifer not found
Traceback (most recent call last):
  File "/usr/local/bin/ariba", line 312, in <module>
    args.func(args)
  File "/home/cpeeters/.local/lib/python3.7/site-packages/ariba/tasks/getref.py", line 11, in run
    getter.run(options.outprefix)
  File "/home/cpeeters/.local/lib/python3.7/site-packages/ariba/ref_genes_getter.py", line 664, in run
    exec('self._get_from_' + self.ref_db + '(outprefix)')
  File "<string>", line 1, in <module>
  File "/home/cpeeters/.local/lib/python3.7/site-packages/ariba/ref_genes_getter.py", line 650, in _get_from_ncbi
    id=f"{id[0]}.{accession}",
TypeError: 'NoneType' object is not subscriptable

Someone knows how to fix this (either in ariba, or in the source genbank entry)?

I tried ARIBA on a batch of genomes using CARD and really love the results, but I would need NCBI reference DB instead for this dataset, so any help is much appreciated!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants