Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use newer uniref 100 database for kegg calling #78

Closed
cmkobel opened this issue Jan 22, 2024 · 5 comments
Closed

Use newer uniref 100 database for kegg calling #78

cmkobel opened this issue Jan 22, 2024 · 5 comments
Labels
enhancement New feature or request

Comments

@cmkobel
Copy link
Owner

cmkobel commented Jan 22, 2024

According to the checkm2 paper, the current version may be as old as from 2018?

@cmkobel
Copy link
Owner Author

cmkobel commented Mar 12, 2024

Idea:

One plan could be instead to use GO biological process https://genomespot.blogspot.com/2024/02/dont-use-kegg.html

So download uniref and mapping_selected (https://www.uniprot.org/help/downloads). Map with diamond or mmseqs2. Download GO BP and perform the the hierarchical mapping and compute enrichment.

@cmkobel
Copy link
Owner Author

cmkobel commented Mar 18, 2024

Ref chklovski/CheckM2#99

Looks like KEGG is not a viable option for the future, and it is not possible to continue reusing the checkm2 database. Will have to seriously consider implementing GO BP GSEA.

@cmkobel
Copy link
Owner Author

cmkobel commented Mar 23, 2024

As I just closed #90 for merging it into here, I should mention that in any case, for licensing reasons (I think) the user (pipeline instance) must manually download the file:

Downloaded from https://www.kegg.jp/kegg-bin/download_htext?htext=ko00001.keg&format=json&filedir=
Used in kegg_pathway.R.

@cmkobel
Copy link
Owner Author

cmkobel commented May 14, 2024

I still don't have a good plan for this. Is going back to kofam_scan the best option? How does eggnog do it, and can its output for downstream hypertests?

@cmkobel cmkobel added the enhancement New feature or request label May 14, 2024
@cmkobel
Copy link
Owner Author

cmkobel commented May 23, 2024

Solution was to use eggnog to map to KO. Is used in rule kegg_pathway as well now.

@cmkobel cmkobel closed this as completed May 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant