Application of Machine Learning To Improve the Results of High-Throughput Docking Against the HIV-1 Protease
Citations Over TimeTop 10% of 2004 papers
Abstract
We have previously reported that the application of a Laplacian-modified naive Bayesian (NB) classifier may be used to improve the ranking of known inhibitors from a random database of compounds after High-Throughput Docking (HTD). The method relies upon the frequency of substructural features among the active and inactive compounds from 2D fingerprint information of the compounds. Here we present an investigation of the role of extended connectivity fingerprints in training the NB classifier against HTD studies on the HIV-1 protease using three docking programs: Glide, FlexX, and GOLD. The results show that the performance of the NB classifier is due to the presence of a large number of features common to the set of known active compounds rather than a single structural or substructural scaffold. We demonstrate that the Laplacian-modified naive Bayesian classifier trained with data from high-throughput docking is superior at identifying active compounds from a target database in comparison to conventional two-dimensional substructure search methods alone.
Related Papers
- → Performance of SMOTE in a random forest and naive Bayes classifier for imbalanced Hepatitis-B vaccination status(2021)9 cited
- → Prediction of Phishing Sites in Network using Naive Bayes compared over Random Forest with improved Accuracy(2023)6 cited
- → Analysis And Comparison Of Prediction Of Heart Disease Using Novel Random Forest And Naive Bayes Algorithm(2023)1 cited
- KLASIFIKASI DIABETES SUKU INDIAN PIMA MENGGUNAKANKOMBINASI METODE RANDOM FOREST DAN NAIVE BAYES(2020)
- → Performance Analysis of Heart Disease Prediction System using Novel Random Forest Over Naive Bayes Algorithm with an Improved Accuracy Rate(2023)