Faster SEQUEST Searching for Peptide Identification from Tandem Mass Spectra
Citations Over TimeTop 10% of 2011 papers
Abstract
Computational analysis of mass spectra remains the bottleneck in many proteomics experiments. SEQUEST was one of the earliest software packages to identify peptides from mass spectra by searching a database of known peptides. Though still popular, SEQUEST performs slowly. Crux and TurboSEQUEST have successfully sped up SEQUEST by adding a precomputed index to the search, but the demand for ever-faster peptide identification software continues to grow. Tide, introduced here, is a software program that implements the SEQUEST algorithm for peptide identification and that achieves a dramatic speedup over Crux and SEQUEST. The optimization strategies detailed here employ a combination of algorithmic and software engineering techniques to achieve speeds up to 170 times faster than a recent version of SEQUEST that uses indexing. For example, on a single Xeon CPU, Tide searches 10,000 spectra against a tryptic database of 27,499 Caenorhabditis elegans proteins at a rate of 1550 spectra per second, which compares favorably with a rate of 8.8 spectra per second for a recent version of SEQUEST with index running on the same hardware.
Related Papers
- → JUMP: A Tag-based Database Search Tool for Peptide Identification with High Sensitivity and Accuracy(2014)190 cited
- → Comparison of Mascot and X!Tandem Performance for Low and High Accuracy Mass Spectrometry and the Development of an Adjusted Mascot Threshold(2008)70 cited
- → Optimized Web Searching Using Inverted Indexing Technique(2022)10 cited
- → Effects of spam removal on search engine efficiency and effectiveness(2012)8 cited
- → 3D Systems acquires Belgian printer(2014)