The Reduced Graph Descriptor in Virtual Screening and Data-Driven Clustering of High-Throughput Screening Data
Citations Over TimeTop 11% of 2004 papers
Abstract
Virtual screening and high-throughput screening are two major components of lead discovery within the pharmaceutical industry. In this paper we describe improvements to previously published methods for similarity searching with reduced graphs, with a particular focus on ligand-based virtual screening, and describe a novel use of reduced graphs in the clustering of high-throughput screening data. Literature methods for reduced graph similarity searching encode the reduced graphs as binary fingerprints, which has a number of issues. In this paper we extend the definition of the reduced graph to include positively and negatively ionizable groups and introduce a new method for measuring the similarity of reduced graphs based on a weighted edit distance. Moving beyond simple similarity searching, we show how more flexible queries can be built using reduced graphs and describe a database system that allows iterative querying with multiple representations. Reduced graphs capture many important features of ligand-receptor interactions and, in conjunction with other whole molecule descriptors, provide an informative way to review HTS data. We describe a novel use of reduced graphs in this context, introducing a method we have termed data-driven clustering, that identifies clusters of molecules represented by a particular whole molecule descriptor and enriched in active compounds.
Related Papers
- → High-throughput and ultra-high-throughput screening: solution- and cell-based approaches(2000)457 cited
- → Compound Transfer by Acoustic Droplet Ejection Promotes Quality and Efficiency in Ultra-High-Throughput Screening Campaigns(2015)30 cited
- → Clustering - What Both Theoreticians and Practitioners Are Doing Wrong(2018)1 cited
- → Study of the throughput of WLAN according to channeling and density of AP(2012)1 cited
- → Evaluation of AHS throughput using SmartCap(1997)6 cited