Author/Authors :
Karimi Darabi, Parisa IT Group - Industrial Engineering K. N. Toosi University of Technology, Tehran , Tarokh, Mohammad Jafar IT Group - Faculty of Industrial Engineering K. N. Toosi University of Technology, Tehran
Abstract :
Background and objective: Currently, diabetes is one of the leading causes of
death in the world. According to several factors diagnosis of this disease is
complex and prone to human error. This study aimed to analyze the risk of having diabetes based on laboratory information, life style and, family history with the
help of machine learning algorithms. When the model is trained properly, people
can examine their risk of having diabetes.
Methods: To classify patients, by using Python, eight different machine learning
algorithms (Logistic Regression, Nearest Neighbor, Decision Tree, Random
Forest, Support Vector Machine, Naive Bayesian, Neural Network and Gradient
Boosting) were analysed. were evaluated by accuracy, sensitivity, specificity and
ROC curve parameters.
Results: The model based on the gradient boosting algorithm showed the best
performance with a prediction accuracy of %95.50.
Conclusion: In the future, this model can be used for diagnosis diabete. The basis of this study is to do more research and develop models such as other learning machine algorithms.
Keywords :
Prediction , diabetes , machine learning , gradient boosting , ROC curve