Centrifuge

Classifier for metagenomic sequences

[Centrifuge] is a novel microbial classification engine that enables rapid, accurate and sensitive labeling of reads and quantification of species on desktop computers. The system uses a novel indexing scheme based on the Burrows-Wheeler transform (BWT) and the Ferragina-Manzini (FM) index, optimized specifically for the metagenomic classification problem. Centrifuge requires a relatively small index (4.7 GB for all complete bacterial and viral genomes plus the human genome) and classifies sequences at very high speed, allowing it to process the millions of reads from a typical high-throughput DNA sequencing run within a few minutes. Together these advances enable timely and accurate analysis of large metagenomics data sets on conventional desktop computers

The Centrifuge hompage is http://www.ccb.jhu.edu/software/centrifuge

The Centrifuge paper is available at https://genome.cshlp.org/content/26/12/1721

The Centrifuge poster is available at http://www.ccb.jhu.edu/people/infphilo/data/Centrifuge-poster.pdf

For more details on installing and running Centrifuge, look at MANUAL

Quick guide

Installation from source

git clone https://github.com/DaehwanKimLab/centrifuge
cd centrifuge
make
sudo make install prefix=/usr/local

Building indexes

We provide several indexes on the Centrifuge homepage at http://www.ccb.jhu.edu/software/centrifuge. Centrifuge needs sequence and taxonomy files, as well as sequence ID to taxonomy ID mapping. See the MANUAL files for details. We provide a Makefile that simplifies the building of several standard and custom indices

cd indices
make p+h+v                   # bacterial, human, and viral genomes [~12G]
make p_compressed            # bacterial genomes compressed at the species level [~4.2G]
make p_compressed+h+v        # combination of the two above [~8G]

Name		Name	Last commit message	Last commit date
Latest commit History 422 Commits
centrifuge.xcodeproj		centrifuge.xcodeproj
doc		doc
evaluation		evaluation
example		example
indices		indices
third_party		third_party
.gitignore		.gitignore
AUTHORS		AUTHORS
LICENSE		LICENSE
MANUAL		MANUAL
MANUAL.markdown		MANUAL.markdown
Makefile		Makefile
NEWS		NEWS
README.md		README.md
TUTORIAL		TUTORIAL
VERSION		VERSION
aligner_bt.cpp		aligner_bt.cpp
aligner_bt.h		aligner_bt.h
aligner_cache.cpp		aligner_cache.cpp
aligner_cache.h		aligner_cache.h
aligner_metrics.h		aligner_metrics.h
aligner_result.h		aligner_result.h
aligner_seed.cpp		aligner_seed.cpp
aligner_seed.h		aligner_seed.h
aligner_seed_policy.cpp		aligner_seed_policy.cpp
aligner_seed_policy.h		aligner_seed_policy.h
aligner_sw.cpp		aligner_sw.cpp
aligner_sw.h		aligner_sw.h
aligner_sw_common.h		aligner_sw_common.h
aligner_sw_nuc.h		aligner_sw_nuc.h
aligner_swsse.cpp		aligner_swsse.cpp
aligner_swsse.h		aligner_swsse.h
aligner_swsse_ee_i16.cpp		aligner_swsse_ee_i16.cpp
aligner_swsse_ee_u8.cpp		aligner_swsse_ee_u8.cpp
aligner_swsse_loc_i16.cpp		aligner_swsse_loc_i16.cpp
aligner_swsse_loc_u8.cpp		aligner_swsse_loc_u8.cpp
aln_sink.h		aln_sink.h
alphabet.cpp		alphabet.cpp
alphabet.h		alphabet.h
assert_helpers.h		assert_helpers.h
binary_sa_search.h		binary_sa_search.h
bitpack.h		bitpack.h
blockwise_sa.h		blockwise_sa.h
bt2_idx.cpp		bt2_idx.cpp
bt2_idx.h		bt2_idx.h
bt2_io.h		bt2_io.h
bt2_util.h		bt2_util.h
btypes.h		btypes.h
ccnt_lut.cpp		ccnt_lut.cpp
centrifuge		centrifuge
centrifuge-BuildSharedSequence.pl		centrifuge-BuildSharedSequence.pl
centrifuge-RemoveEmptySequence.pl		centrifuge-RemoveEmptySequence.pl
centrifuge-RemoveN.pl		centrifuge-RemoveN.pl
centrifuge-build		centrifuge-build
centrifuge-compress.pl		centrifuge-compress.pl
centrifuge-download		centrifuge-download
centrifuge-inspect		centrifuge-inspect
centrifuge-kreport		centrifuge-kreport
centrifuge-promote		centrifuge-promote
centrifuge-sort-nt.pl		centrifuge-sort-nt.pl
centrifuge.cpp		centrifuge.cpp
centrifuge_build.cpp		centrifuge_build.cpp
centrifuge_build_main.cpp		centrifuge_build_main.cpp
centrifuge_compress.cpp		centrifuge_compress.cpp
centrifuge_inspect.cpp		centrifuge_inspect.cpp
centrifuge_main.cpp		centrifuge_main.cpp
centrifuge_report.cpp		centrifuge_report.cpp
classifier.h		classifier.h
diff_sample.cpp		diff_sample.cpp
diff_sample.h		diff_sample.h
dp_framer.cpp		dp_framer.cpp
dp_framer.h		dp_framer.h
ds.cpp		ds.cpp
ds.h		ds.h
edit.cpp		edit.cpp
edit.h		edit.h
endian_swap.h		endian_swap.h
fast_mutex.h		fast_mutex.h
filebuf.h		filebuf.h
formats.h		formats.h
functions.sh		functions.sh
group_walk.cpp		group_walk.cpp
group_walk.h		group_walk.h
hi_aligner.h		hi_aligner.h
hier_idx.h		hier_idx.h
hier_idx_common.h		hier_idx_common.h
hyperloglogbias.h		hyperloglogbias.h
hyperloglogplus.h		hyperloglogplus.h
limit.cpp		limit.cpp
limit.h		limit.h
ls.cpp		ls.cpp
ls.h		ls.h
mask.cpp		mask.cpp
mask.h		mask.h
mem_ids.h		mem_ids.h
mm.h		mm.h
multikey_qsort.h		multikey_qsort.h
opts.h		opts.h
outq.cpp		outq.cpp
outq.h		outq.h

License

DaehwanKimLab/centrifuge

Folders and files

Latest commit

History