Proteome Analyst: custom predictions with explanations in a web-based tool for high-throughput proteome annotations
Citations Over TimeTop 11% of 2004 papers
Abstract
Proteome Analyst (PA) (http://www.cs.ualberta.ca/~bioinfo/PA/) is a publicly available, high-throughput, web-based system for predicting various properties of each protein in an entire proteome. Using machine-learned classifiers, PA can predict, for example, the GeneQuiz general function and Gene Ontology (GO) molecular function of a protein. In addition, PA is currently the most accurate and most comprehensive system for predicting subcellular localization, the location within a cell where a protein performs its main function. Two other capabilities of PA are notable. First, PA can create a custom classifier to predict a new property, without requiring any programming, based on labeled training data (i.e. a set of examples, each with the correct classification label) provided by a user. PA has been used to create custom classifiers for potassium-ion channel proteins and other general function ontologies. Second, PA provides a sophisticated explanation feature that shows why one prediction is chosen over another. The PA system produces a Naïve Bayes classifier, which is amenable to a graphical and interactive approach to explanations for its predictions; transparent predictions increase the user's confidence in, and understanding of, PA.
Related Papers
- → Accelerating the search for the missing proteins in the human proteome(2017)98 cited
- → The potential clinical impact of the tissue-based map of the human proteome(2015)68 cited
- → A Proteome-wide Domain-centric Perspective on Protein Phosphorylation(2014)6 cited
- → The Use of Protein—protein Interaction Networks for Genome Wide Protein Function Comparisons and Predictions(2004)5 cited
- Towards subcellular localization of the human proteome using bioimaging(2012)