Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Count duplicate K Numbers (KEGG) for enrichKEGG? #681

Open
diefuechsin opened this issue Apr 8, 2024 · 0 comments
Open

Count duplicate K Numbers (KEGG) for enrichKEGG? #681

diefuechsin opened this issue Apr 8, 2024 · 0 comments

Comments

@diefuechsin
Copy link

Hi!

I am doing an analysis with clusterProfiler. I am wondering if it would make sense to let enrichKEGG count duplicate k numbers (KEGG numbers such as "K02760") for pathway enrichment analysis?

In my dataset, I have annotated coding sequences to K numbers and some K numbers appear in duplicate/multiple times. Would it be beneficial for the outcome of an over-representation study to also count the duplicates/multiples for GeneRatio and statistics?

I observed, for example, K02760 (assigned to pathway ko02060) was annotated to two different coding sequences. However, K02760 seems to be counted only once by enrichKEGG for the statistics and GeneRatio for this pathway.

Thanks to your comment/help in advance!
Best regards

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant