Risk prediction of type II diabetes based on random forest model
Citations Over TimeTop 10% of 2017 papers
Abstract
In recent years, type II diabetes has become a serious disease that threaten the health and mind of human. Efficient predictive modeling is required for medical researchers and practitioners. This study proposes a type II diabetes prediction model based on random forest which aims at analyzing some readily available indicators (age, weight, waist, hip, etc.) effects on diabetes and discovering some rules on given data. The method can significantly reduce the risk of disease through digging out a clear and understandable model for type II diabetes from a medical database. Random forest algorithm uses multiple decision trees to train the samples, and integrates weight of each tree to get the final results. The validation results at school of medicine, University of Virginia shows that the random forest algorithm can greatly reduce the problem of over-fitting of the single decision tree, and it can effectively predict the impact of these readily available indicators on the risk of diabetes. Additionally, we get a better prediction accuracy using random forest than using the naive Bayes algorithm, ID3 algorithm and AdaBoost algorithm.
Related Papers
- → Performance of SMOTE in a random forest and naive Bayes classifier for imbalanced Hepatitis-B vaccination status(2021)9 cited
- → Prediction of Phishing Sites in Network using Naive Bayes compared over Random Forest with improved Accuracy(2023)6 cited
- KLASIFIKASI DIABETES SUKU INDIAN PIMA MENGGUNAKANKOMBINASI METODE RANDOM FOREST DAN NAIVE BAYES(2020)
- → Performance Analysis of Heart Disease Prediction System using Novel Random Forest Over Naive Bayes Algorithm with an Improved Accuracy Rate(2023)
- → Sentiment Analysis of RUU PDP with Naive Bayes, Support Vector Machine, and Random Forest Classification Algorithm(2022)