Prediction of Missed Cleavage Sites in Tryptic Peptides Aids Protein Identification in Proteomics
Citations Over TimeTop 14% of 2006 papers
Abstract
Protein identification via peptide mass fingerprinting (PMF) remains a key component of high-throughput proteomics experiments in post-genomic science. Candidate protein identifications are made using bioinformatic tools from peptide peak lists obtained via mass spectrometry (MS). These algorithms rely on several search parameters, including the number of potential uncut peptide bonds matching the primary specificity of the hydrolytic enzyme used in the experiment. Typically, up to one of these "missed cleavages" are considered by the bioinformatics search tools, usually after digestion of the in silico proteome by trypsin. Using two distinct, nonredundant datasets of peptides identified via PMF and tandem MS, a simple predictive method based on information theory is presented which is able to identify experimentally defined missed cleavages with up to 90% accuracy from amino acid sequence alone. Using this simple protocol, we are able to "mask" candidate protein databases so that confident missed cleavage sites need not be considered for in silico digestion. We show that that this leads to an improvement in database searching, with two different search engines, using the PMF dataset as a test set. In addition, the improved approach is also demonstrated on an independent PMF data set of known proteins that also has corresponding high-quality tandem MS data, validating the protein identifications. This approach has wider applicability for proteomics database searching, and the program for predicting missed cleavages and masking Fasta-formatted protein sequence databases has been made available via http:// ispider.smith.man.ac uk/MissedCleave.
Related Papers
- → Applications of Tandem Mass Spectrometry (MS/MS) in Protein Analysis for Biomedical Research(2022)129 cited
- → Phospho-proteomics: Evaluation of the use of enzymatic de-phosphorylation and differential mass spectrometric peptide mass mapping for site specific phosphorylation assignment in proteins separated by gel electrophoresis(2001)154 cited
- → A single protein to multiple peptides: Investigation of protein-peptide correlations using targeted alpha-2-macroglobulin analysis(2023)2 cited
- → Integration of Chromatography and Peptide Mass Modification for Quantitative Proteomics(2003)2 cited
- → A single protein to multiple peptides: Investigation of protein-peptide relationship using targeted alpha-2-macroglobulin analysis(2022)