Fuzzy Measures on the Gene Ontology for Gene Product Similarity
Citations Over TimeTop 10% of 2006 papers
Abstract
One of the most important objects in bioinformatics is a gene product (protein or RNA). For many gene products, functional information is summarized in a set of Gene Ontology (GO) annotations. For these genes, it is reasonable to include similarity measures based on the terms found in the GO or other taxonomy. In this paper, we introduce several novel measures for computing the similarity of two gene products annotated with GO terms. The fuzzy measure similarity (FMS) has the advantage that it takes into consideration the context of both complete sets of annotation terms when computing the similarity between two gene products. When the two gene products are not annotated by common taxonomy terms, we propose a method that avoids a zero similarity result. To account for the variations in the annotation reliability, we propose a similarity measure based on the Choquet integral. These similarity measures provide extra tools for the biologist in search of functional information for gene products. The initial testing on a group of 194 sequences representing three proteins families shows a higher correlation of the FMS and Choquet similarities to the BLAST sequence similarities than the traditional similarity measures such as pairwise average or pairwise maximum.
Related Papers
- → Correlation between Gene Expression and GO Semantic Similarity(2005)247 cited
- → A Cosine Similarity Measure Based on the Choquet Integral for Intuitionistic Fuzzy Sets and Its Applications to Pattern Recognition(2021)29 cited
- → A novel insight into Gene Ontology semantic similarity(2013)61 cited
- → A New Similarity Measure between  Intuitionistic Fuzzy Sets Based on a Choquet Integral Model(2008)11 cited
- → The Choquet Integral(2022)