Skip to content

A tool to extract mappable reads from various library preparation protocols.

License

Notifications You must be signed in to change notification settings

TimoLassmann/tagdust

Repository files navigation

Introduction

Raw sequences produced by next generation sequencing (NGS) machines contain adapter, linker, barcode and fingerprint (UMI) sequences. TagDust2 is designed to make is as easy as possible to de-multiplex reads from any library preparation method. In addition, TagDust2 can detect which library prep. was used from the raw reads themselves making it possible to automate processing pipelines.

Image of example output

TagDust allows users to specify the expected architecture of a read and converts it into a hidden Markov model. The latter can assign sequences to a particular barcode (or index) even in the presence of sequencing errors. Sequences not matching the architecture (primer dimers, contaminants etc.) are automatically discarded.

Installation:

Unpack the tarball:

tar -zxvf tagdust-XXX.tar.gz
cd tagdust
./autogen.sh
./configure
make
make check

At this point the TagDust executable appears in the src directory. You can copy it to any directory in your path. To install it system wide type:

make install

Manual

Have a look at the user manual in the doc directory!

Please cite:

Lassmann, Timo. "TagDust2: a generic method to extract reads from sequencing data." BMC bioinformatics 16.1 (2015): 24. doi.org/10.1186/s12859-015-0454-y

About

A tool to extract mappable reads from various library preparation protocols.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published