Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Conda Distribution for Taxonomic Identification #117

Open
erfanshekarriz opened this issue Nov 12, 2023 · 1 comment
Open

Comments

@erfanshekarriz
Copy link

Hi and hope all is well!

I wanted to inquire about the conda installation timeline. I know that PPR-Meta is not conda-installable which is why you need to withhold the conda implementation. I wanted to ask if meanwhile the taxonomic identification module could be implemented as a separate conda package.

I think the first step in viral contig identification can be mostly up to personal preference with a wide array of tools out there and more developing (e.g. geNomad, Phamer/PhaBox, etc), but your taxonomic identification protocol is quite unique, useful, and the tools for viral taxonomic annotation are rare!

I was wondering since the taxonomic identification doesn't rely on PPR-meta if a conda recipe can be generated for it separately. I'm currently working on building a custom viral calling pipeline for our lab, and have been struggling to find good taxonomic identification protocols that are conda installable and can be easily integrated into Snakemake.

Thanks for the great tool!

Best,

Erfan

@hoelzer
Copy link
Collaborator

hoelzer commented Dec 11, 2023

Hey @erfanshekarriz

I think something like that is already possible, but I have not tested it for a while. You can use the --only annotate (https://github.com/EBI-Metagenomics/emg-viral-pipeline/blob/master/virify.nf#L755C7-L755C19) parameter to skip virus prediction and only run the annotation module on all your contigs in your input FASTA (only a length filter applies).

Via that, I think you can even use the -profile conda.

We might have to bump some tool versions in the conda env files to match the current tool versions in the Docker containers but that would be possible.

Does that help?

I think generating a conda package to cover the whole taxonomic classification part of VIRify is beyond the scope of the pipeline and it's implementation. But I also get your point about separating virus contig prediction and taxonomy annotation!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants