Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cut ASVs for taxonomy assignment #225

Open
erikrikarddaniel opened this issue Feb 23, 2021 · 5 comments
Open

Cut ASVs for taxonomy assignment #225

erikrikarddaniel opened this issue Feb 23, 2021 · 5 comments
Labels
enhancement New feature or request not urgent

Comments

@erikrikarddaniel
Copy link
Member

In some cases, a user might have sequenced an amplicon that is longer than the sequences in the database one wants to use. For this to work, ASV sequences could be cut before taxonomy assignment.

@erikrikarddaniel erikrikarddaniel added enhancement New feature or request not urgent labels Feb 23, 2021
@d4straub
Copy link
Collaborator

Interesting, do you have an example when that would be the case? To illustrate the problem.
With what parameters would you cut the ASV sequences? Degenerated nucleotide sequence? Nucleotide positions?

@erikrikarddaniel
Copy link
Member Author

People are apparently sequencing whole rRNA operons, but most databases are limited to a single gene, or ITS, per sequence. To assign taxonomy, one would hence have to cut down the ASV to what's in a particular database. The alternative would be to trust that the kmer distribution is the same, but I don't think this would be good.

@jtangrot
Copy link
Contributor

jtangrot commented May 3, 2021

We have sequenced more or less the whole rRNA operon in fungi, but as (most of) UNITE only contain the ITS region we need to cut the resulting ASVs and use only the ITS (or even ITS2) region for the taxonomy assignment. For this we use ITSx (https://microbiology.se/software/itsx/), which can be used both for fungi and other phyla. Would it be an option to include this as an optional step, e.g. with a parameter --cut_its?

@erikrikarddaniel
Copy link
Member Author

I suppose we were thinking of something general, and this sounds specific to ITS. OTOH, better to have something that works for the only use case I'm aware of than nothing, so, in my opinion, go ahead and add.

@d4straub
Copy link
Collaborator

So is this solved?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request not urgent
Projects
None yet
Development

No branches or pull requests

3 participants