New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Small (mock) reference data #572
Comments
Hi Brent. GTDB-TK has the command https://ecogenomics.github.io/GTDBTk/commands/check_install.html |
The processed genomes are not the problem. The problem is the reference data (i.e. GTDB). We are wondering if there is a small database (or can be constructed) that we can use in tests. Our problem is that we have hundreds of tools in one of the main Galaxy tool repos (https://github.com/galaxyproject/tools-iuc/) and have to restrict to small tests (and reference) data. |
Hi Brent, Makes sense, but unfortunately we don't have such a set of reference data. What might work for you is to run a single genome that belongs to a species in the GTDB reference database. This will result in only the ANI prescreen part of GTDB-Tk running and thus avoid the memory requirement and time required by the tree place (pplacer) step. This obviously isn't a full test of GTDB-Tk, but at least demonstrates it still runs. Cheers, |
For the Galaxy tool the classify workflow it would be great if there would be small reference data to run a test in the CI.
Is there a possibility to do this?
The text was updated successfully, but these errors were encountered: