Fast and sensitive taxonomic classification for metagenomics with Kaiju
Citations Over TimeTop 1% of 2016 papers
Abstract
Metagenomics emerged as an important field of research not only in microbial ecology but also for human health and disease, and metagenomic studies are performed on increasingly larger scales. While recent taxonomic classification programs achieve high speed by comparing genomic k-mers, they often lack sensitivity for overcoming evolutionary divergence, so that large fractions of the metagenomic reads remain unclassified. Here we present the novel metagenome classifier Kaiju, which finds maximum (in-)exact matches on the protein-level using the Burrows-Wheeler transform. We show in a genome exclusion benchmark that Kaiju classifies reads with higher sensitivity and similar precision compared with current k-mer-based classifiers, especially in genera that are underrepresented in reference databases. We also demonstrate that Kaiju classifies up to 10 times more reads in real metagenomes. Kaiju can process millions of reads per minute and can run on a standard PC. Source code and web server are available at http://kaiju.binf.ku.dk.
Related Papers
- → Taxonomic sufficiency and the increasing insufficiency of taxonomic expertise(2003)159 cited
- → Evaluation of tools for taxonomic classification of viruses(2022)7 cited
- Taxonomic classification of metagenomic sequences(2012)
- → On the activation energy of the dc conductivity of organic liquids(1964)2 cited
- → Taxonomic Classification of Metagenomic Shotgun Sequences with CARMA3(2013)