Correlated mutation analyses on super‐family alignments reveal functionally important residues
Citations Over TimeTop 10% of 2009 papers
Abstract
Correlated mutation analyses (CMA) on multiple sequence alignments are widely used for the prediction of the function of amino acids. The accuracy of CMA-based predictions is mainly determined by the number of sequences, by their evolutionary distances, and by the quality of the alignments. These criteria are best met in structure-based sequence alignments of large super-families. So far, CMA-techniques have mainly been employed to study the receptor interactions. The present work shows how a novel CMA tool, called Comulator, can be used to determine networks of functionally related residues in enzymes. These analyses provide leads for protein engineering studies that are directed towards modification of enzyme specificity or activity. As proof of concept, Comulator has been applied to four enzyme super-families: the isocitrate lyase/phoshoenol-pyruvate mutase super-family, the hexokinase super-family, the RmlC-like cupin super-family, and the FAD-linked oxidases super-family. In each of those cases networks of functionally related residue positions were discovered that upon mutation influenced enzyme specificity and/or activity as predicted. We conclude that CMA is a powerful tool for redesigning enzyme activity and selectivity.
Related Papers
- → PALI--a database of Phylogeny and ALIgnment of homologous protein structures(2001)103 cited
- → Sequence analysis corresponding to the PPE and PE proteins inMycobacterium tuberculosis and other genomes(2003)58 cited
- → SUPFAM--a database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families: implications for structural genomics and function annotation in genomes(2002)51 cited
- → Integration of related sequences with protein three-dimensional structural families in an updated version of PALI database(2003)46 cited
- → HomologyPlot: Searching for homology to a family of proteins using a database of unique conserved patterns(1994)1 cited