Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TransDecoder.Predict flag --single_best_only produces .pep file with multiple ORFs per transcript id #176

Open
peterszoevenyi opened this issue Jul 12, 2023 · 5 comments

Comments

@peterszoevenyi
Copy link

Dear Brian,

We are trying to create the best ORF prediction for each transcript in a fasta file.

We run TransDecoder.LongOrfs with the --complete_orfs_only flag.
Then run TransDecoder.Predict with the --single_best_only flag.

Despite using the --single_best_only flag the longest_orfs.pep (25902 fasta header entries) file contains many more peptide predictions than the transcript.fasta (11355 unique header entries) file used. This is because the longest_orfs.pep file reprorts multiple peptide predictions per transcript id named as "trasncriptid1.p1" and "transcriptid1.p2" etc.
Furthermore, if I grep only the headers containing the ".p1" string from the longest_orfs.pep file, I have fewer entries (8029) than in the transcript.fasta file (11355 unique header entries). I guess this means that some proportion of the transcript.fasta transcripts produced partial ORF predictions that were filtered out in the first step of the process, right?

According to my understanding (and the help of the TransDecoder.Predict code) the --single_best_only flag is expect to retain only the single best ORF per transcript id. Nevertheless, the results suggest that this is not the case.

Is this a bug or have we misinterpreted the description of the --single_best_only flag?

We are using /TransDecoder-TransDecoder-v5.7.0/

I would appreciate if you could let me know your thoughts on this issue which we have not been able to resolve.
With kind regards.
Peter

@brianjohnhaas
Copy link
Contributor

brianjohnhaas commented Jul 13, 2023 via email

@brianjohnhaas
Copy link
Contributor

brianjohnhaas commented Jul 14, 2023 via email

@brianjohnhaas
Copy link
Contributor

brianjohnhaas commented Jul 14, 2023 via email

@peterszoevenyi
Copy link
Author

peterszoevenyi commented Jul 17, 2023 via email

@brianjohnhaas
Copy link
Contributor

brianjohnhaas commented Jul 17, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants