DALI shines a light on remote homologs: One hundred discoveries
Citations Over TimeTop 1% of 2022 papers
Abstract
Structural comparison reveals remote homology that often fails to be detected by sequence comparison. The DALI web server (http://ekhidna2.biocenter.helsinki.fi/dali) is a platform for structural analysis that provides database searches and interactive visualization, including structural alignments annotated with secondary structure, protein families and sequence logos, and 3D structure superimposition supported by color-coded sequence and structure conservation. Here, we are using DALI to mine the AlphaFold Database version 1, which increased the structural coverage of protein families by 20%. We found 100 remote homologous relationships hitherto unreported in the current reference database for protein domains, Pfam 35.0. In particular, we linked 35 domains of unknown function (DUFs) to the previously characterized families, generating a functional hypothesis that can be explored downstream in structural biology studies. Other findings include gene fusions, tandem duplications, and adjustments to domain boundaries. The evidence for homology can be browsed interactively through live examples on DALI's website.
Related Papers
- → Modeller: Generation and Refinement of Homology-Based Protein Structure Models(2003)1,735 cited
- → PROMALS3D web server for accurate multiple protein sequence and structure alignments(2008)186 cited
- → PSSRDBModel- Protein 3D structure prediction server based on the secondary structure informations(2019)8 cited
- → GenDiS database update with improved approach and features to recognize homologous sequences of protein domain superfamilies(2019)4 cited
- → Modeling three-dimensional protein structures for amino acid sequences of the CASP3 experiment using sequence-derived predictions(1999)14 cited