You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Dec 11, 2022. It is now read-only.
Thank you for the useful script, but I noticed an error in my output for kaks.py. Because the script totals the numbers of missense and sense SNPs per gene, I found the counts to be redundant for genes that have alternatively spliced transcripts annotated in the gff/gtf file used for SnpEff. For example, two missense SNPs in one gene were each counted 12 times (once per isoform that included the exon in question) giving me a kA of 24 and a kS of 0. I was able to modify the script on my own to give me kA/kS for each feature (mRNA accession) as opposed to each gene, but it might also be useful to output kA/kS per gene while accounting for SNP redundancy.
The text was updated successfully, but these errors were encountered:
My solution was a quick and dirty one where I just swapped out
"geneid_index" for "feature_id_index", which gave me ka/ks values for each
unique RNA isoform instead of each gene. I would imagine one
possible solution for users to figure out ka/ks per gene would be something
like a bedtools merge type of step to modify the gff prior to running
SNPeff, but to check for redundancy the output of your script would be more
complicated. I'm not sure the best way to do that within your script though.
-Josh
I see. Thanks for your reply, Josh, worth it to know how you solved it, even if was "quick and dirty". If I come with a solution, I'll share it here.
Best,
Nabor
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Thank you for the useful script, but I noticed an error in my output for kaks.py. Because the script totals the numbers of missense and sense SNPs per gene, I found the counts to be redundant for genes that have alternatively spliced transcripts annotated in the gff/gtf file used for SnpEff. For example, two missense SNPs in one gene were each counted 12 times (once per isoform that included the exon in question) giving me a kA of 24 and a kS of 0. I was able to modify the script on my own to give me kA/kS for each feature (mRNA accession) as opposed to each gene, but it might also be useful to output kA/kS per gene while accounting for SNP redundancy.
The text was updated successfully, but these errors were encountered: