Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Annotation name not found #112

Open
rkyger-git opened this issue Nov 8, 2022 · 11 comments
Open

Annotation name not found #112

rkyger-git opened this issue Nov 8, 2022 · 11 comments

Comments

@rkyger-git
Copy link

rkyger-git commented Nov 8, 2022

Hello fairliereese,

I ran into the same issue in this closed thread (#91), when trying to run either talon_filter_transcripts or talon_abundance.
Neither my .db file, nor the output files from the previous step ("Running TALON"), are empty.
Do you know what might be causing the problem?

Originally posted by @rkyger-git in #91 (comment)

@fairliereese
Copy link
Member

Could you please provide the calls you made to TALON and the filtering / abundance utilities that caused this error? Thanks!

@rkyger-git
Copy link
Author

rkyger-git commented Nov 8, 2022

Yes, the code used was:

talon_initialize_database --f A100.gtf --g a100 --a a100_rna --o A100

talon --t 64 --f config.talon --db A100.db --build a100 --o A100_out

talon_filter_transcripts --db A100.db -a a100_rna --o A100_talon_filt_trans

@fairliereese
Copy link
Member

Sorry about the wait, could you please run this python code on your TALON database and tell me what the output is? I'd love to get to the bottom of this bug which seems to be common:

import sqlite3

database = <your talon database>

conn = sqlite3.connect(database)
cursor = conn.cursor()

cursor.execute("SELECT DISTINCT annot_name FROM gene_annotations")
annotations = [str(x[0]) for x in cursor.fetchall()]
conn.close()

print(annotations)

@rkyger-git
Copy link
Author

Ok, I ran the code, the output I get is: ['TALON']

@fairliereese
Copy link
Member

Ok so I ran your talon_initialize_database command with a GTF I had lying around, and directly after running that the output, the annotations that are in the database are ['a100_rna']. This leads me to believe something went wrong with the database initialization. Can you re-run the talon_initialize_database command and before running any reads through TALON, execute that python snippet again?

@rkyger-git
Copy link
Author

I re-ran the talon_initialize_database command, and then ran the python snippet, the output I get this time is: [].

@fairliereese
Copy link
Member

Yeah so that's definitely unexpected and likely what's causing the issue. Would you be able to try re-installing the latest commits from GitHub and rerunning the talon_initialize_database command and seeing if that changes the annotations that are in your db?

If that doesn't work you're welcome to send the GTF you're trying to initialize from to me (freese {at} uci.edu) and I will try to dig into what's going wrong in the source code.

@fairliereese
Copy link
Member

Oh actually based on what I found out in this issue, I think it's possible that your GTF might be formatted in a way that's incompatible with TALON. Check out this wiki entry for more details.

@rkyger-git
Copy link
Author

Thanks, I reformatted the GTF, and I was able to get talon_abundance to work. However, talon_filter_transcripts, produces an empty output file, and no error messages.

@fairliereese
Copy link
Member

Did you rerun talon_initialize_database and talon itself after formatting your GTF? Just checking.

@rkyger-git
Copy link
Author

Yes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants