Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EMAL Blast genome database ID formatting issue #4

Open
sib0 opened this issue Oct 9, 2017 · 3 comments
Open

EMAL Blast genome database ID formatting issue #4

sib0 opened this issue Oct 9, 2017 · 3 comments

Comments

@sib0
Copy link

sib0 commented Oct 9, 2017

Hello,

I encountered an issue while trying to set up ezmap to perform some tests.
I did not use your setup to get around sudo restrictions of my workstation and tried to run the pipeline in desktop mode on the sample fastq you're providing but EMAL returned the following error:

"Staring Step 5 - EMAL PREP\n\n
Determining Genebank ID and Calculating Genome Lengths.....
Traceback (most recent call last):
File "/ezmap/tools/EMAL/EMAL-DataPrep.py", line 176, in
getNucleotideIDs(blastGenomeDBPath)
File "/ezmap/tools/EMAL/EMAL-DataPrep.py", line 37, in getNucleotideIDs
nucleotideID = line.split('|')[1]
IndexError: list index out of range"

It seems to expect fasta headers informations delimited by "|" in the blast genome database which isn't the case in the viral.1.1.genomic.fna file indicated in your wiki. A similar issue is returned during the EMAL Main step. Thank you for your assistance.

@rblawid
Copy link

rblawid commented Oct 19, 2017

Hi,

I do believe I have a similar error as sib0,

"File "/home/rb/EzMap/ezmap-master/tools/EMAL/EMAL-Main.py", line 400, in
information, genomeTaxon, totalNumberOfReads, blastHits = gatherInformation(BLASTFileArray)
File "/home/rb/EzMap/ezmap-master/tools/EMAL/EMAL-Main.py", line 157, in gatherInformation
genomeID = int(linePar[1].split('|')[1])
IndexError: list index out of range"

Thanks a lot for your help,

@LCarioti
Copy link

same problem

"Staring Step 5 - EMAL PREP\n\n
Determining Genebank ID and Calculating Genome Lengths.....
Traceback (most recent call last):
File "/home/v/EzMap/tools/EMAL/EMAL-DataPrep.py", line 176, in
getNucleotideIDs(blastGenomeDBPath)
File "/home/v/EzMap/tools/EMAL/EMAL-DataPrep.py", line 37, in getNucleotideIDs
nucleotideID = line.split('|')[1]
IndexError: list index out of range
Staring Step 6 - EMAL MAIN\n\n

EMAL 0.2b

Number of Threads: 16
Acceptance Value: 0.0001
Grabbing list of files to process from:
/home/v/EzMap/Output1/4-OrganismMapping/
1 files found to process:
/home/v/EzMap/Output1/4-OrganismMapping/SRR102448-PreSelectedReadsExample100KReads-blastn.tsv
Preparing to run...
Traceback (most recent call last):
File "/home/v/EzMap/tools/EMAL/EMAL-Main.py", line 400, in
information, genomeTaxon, totalNumberOfReads, blastHits = gatherInformation(BLASTFileArray)
File "/home/v/EzMap/tools/EMAL/EMAL-Main.py", line 157, in gatherInformation
genomeID = int(linePar[1].split('|')[1])
IndexError: list index out of range
Staring Step 7 - EMAL POST\n\n
Reading in information...
Traceback (most recent call last):
File "/home/v/EzMap/tools/EMAL/EMAL-Post.py", line 200, in
processCSV(inputFile)
File "/home/v/EzMap/tools/EMAL/EMAL-Post.py", line 72, in processCSV
with open(file, 'r') as f:
FileNotFoundError: [Errno 2] No such file or directory: '/home/v/EzMap/Output1/5-RelativeAbundanceEstimation/EzMapTest.gra'
Gathering Step 1 Results...
Gathering Step 2 Results...
Gathering Step 3 Results...
Gathering Step 4 Results...
Gathering Step 5 Results...
Traceback (most recent call last):
File "/home/v/EzMap/Scripts/gatherResults.py", line 422, in
taxonomyLevels = emalInfo[0]
IndexError: list index out of range"

@kyle-lk
Copy link

kyle-lk commented Dec 23, 2018

also this problem. nobody want to fix this problem. another pipleline just for publish....

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants