Recovering the true targets of specific ligands by virtual screening of the protein data bank
Citations Over TimeTop 10% of 2004 papers
Abstract
The Protein Data Bank (PDB) has been processed to extract a screening protein library (sc-PDB) of 2148 entries. A knowledge-based detection algorithm has been applied to 18,000 PDB files to find regular expressions corresponding to either protein, ions, co-factors, solvent, or ligand atoms. The sc-PDB database comprises high-resolution X-ray structures of proteins for which (i) a well-defined active site exists, (ii) the bound-ligand is a small molecular weight molecule. The database has been screened by an inverse docking tool derived from the GOLD program to recover the known target of four unrelated ligands. Both the database and the inverse screening procedures are accurate enough to rank the true target of the four investigated ligands among the top 1% scorers, with 70-100 fold enrichment with respect to random screening. Applying the proposed screening procedure to a small-sized generic ligand was much less accurate suggesting that inverse screening shall be reserved to rather selective compounds.
Related Papers
- → ProPairs: A Data Set for Protein–Protein Docking(2015)16 cited
- → Protein Secondary Structure Prediction Using Super-chains in PDB(2016)2 cited
- → INSILICO DRUG DESIGN AND MOLECULAR DOCKING STUDIES OF NOVEL COUMARIN DERIVATIVES AS ANTI-CANCER AGENTS(2017)4 cited
- → Beyond history and “on a roll”: The list of the most well‐studied human protein structures and overall trends in the protein data bank(2021)