Probabilistic description of protein alignments for sequences and structures
Abstract
A number of equally optimal alignments inherently exist in the sequence and structure comparisons among proteins. To represent the sub-optimal alignments systematically, we have developed a method of generating probabilistic alignments for sequences and structures, by which the correspondence between pairs of residues is evaluated in a probabilistic manner. Our method uses the periodic boundary condition to avoid the entropy artifact favoring full-length matches. In the structure comparison, the environmental effects are incorporated by the mean-field approximation. We applied this method in comparisons of two pairs of proteins with internal symmetry; the first set were proteins of TIM-barrel fold and the second were beta-trefoil fold. These pairs are expected to have distinct sub-optimal alignments suitable for probabilistic description with the periodic boundary. It was shown that the sequence and structure alignments are consistent with each other and that the alignments with the highest probability represent circular permutation.
Related Papers
- → Alignment of protein sequences by their profiles(2004)191 cited
- → Homology-extended sequence alignment(2005)119 cited
- → Optimization of multiple‐sequence alignment based on multiple‐structure alignment(2005)52 cited
- → ALIGN_MTX—An optimal pairwise textual sequence alignment program, adapted for using in sequence-structure alignment(2009)10 cited
- → Multiple Sequence Alignment(2003)1 cited