Identifying Biologically Active Compound Classes Using Phenotypic Screening Data and Sampling Statistics
Citations Over TimeTop 10% of 2005 papers
Abstract
Scoring the activity of compounds in phenotypic high-throughput assays presents a unique challenge because of the limited resolution and inherent measurement error of these assays. Techniques that leverage the structural similarity of compounds within an assay can be used to improve the hit-recovery rate from screening data. A technique is presented that uses clustering and sampling statistics to predict likely compound activity by scoring entire structural classes. A set of phenotypic assays performed against a commercially available compound library was used as a test set. Using the class-scoring technique, the resultant activity prediction scores were more reproducible than individual assay measurements, and class scoring recovered known active compounds more efficiently than individual assay measurements because class scoring had fewer false positives. Known biologically active compounds were recovered 87% of the time using class scores, suggesting a low false-negative rate that compared well to individual assay measurements. In addition, many weak and potentially novel classes of active compounds, overlooked by individual assay measurements, were suggested.
Related Papers
- → False discovery rate, sensitivity and sample size for microarray studies(2005)467 cited
- → Controlling the Proportion of False Positives in Multiple Dependent Tests(2004)169 cited
- → On the Operational Characteristics of the Benjamini and Hochberg False Discovery Rate Procedure(2007)132 cited
- → Towards dependable steganalysis(2015)18 cited
- → Simultaneous control of false positives and false negatives in multiple hypotheses testing(2007)9 cited