A Model for Random Sampling and Estimation of Relative Protein Abundance in Shotgun Proteomics
Citations Over TimeTop 1% of 2004 papers
Abstract
Proteomic analysis of complex protein mixtures using proteolytic digestion and liquid chromatography in combination with tandem mass spectrometry is a standard approach in biological studies. Data-dependent acquisition is used to automatically acquire tandem mass spectra of peptides eluting into the mass spectrometer. In more complicated mixtures, for example, whole cell lysates, data-dependent acquisition incompletely samples among the peptide ions present rather than acquiring tandem mass spectra for all ions available. We analyzed the sampling process and developed a statistical model to accurately predict the level of sampling expected for mixtures of a specific complexity. The model also predicts how many analyses are required for saturated sampling of a complex protein mixture. For a yeast-soluble cell lysate 10 analyses are required to reach a 95% saturation level on protein identifications based on our model. The statistical model also suggests a relationship between the level of sampling observed for a protein and the relative abundance of the protein in the mixture. We demonstrate a linear dynamic range over 2 orders of magnitude by using the number of spectra (spectral sampling) acquired for each protein.
Related Papers
- → Advances in chromatographic techniques and methods in shotgun proteome analysis(2006)58 cited
- → Analysis of Rice Proteins Using SDS-PAGE Shotgun Proteomics(2013)13 cited
- → Shotgun Approaches for Venom Analysis(2014)2 cited
- → Protein Analysis by Shotgun Proteomics(2019)17 cited
- Evolution and Application of Shotgun Strategy for Proteome Study(2006)