Title of article :
Comparison of classification techniques based on medical datasets
Author/Authors :
Abdulhussein Al-Joda, Alyaa Al-Furat Al-Awsat Technical University(ATU) - Al-Najaf, Iraq , Fadhil Abdullah, Enas Faculty of Education for Girls - University of Kufa - Al- Najaf, Iraq , Alasadi, Suad A University of Babylon - Babil, Iraq
Pages :
8
From page :
1957
To page :
1964
Abstract :
Medical data mining has been a widespread data mining area of late. Mainly, diagnosing cancers is one of the most important topics that many researchers studied to develop intelligent decision support systems to help doctors. In this research, three different classifiers are used to improve the performance in terms of accuracy. The classifiers are Support Vector Machine (SVM), Adaptive Boosting (AdaBoost), and Random forests (RF). Two machine learning repository datasets are used to evaluate and verify the classification methods. Classifiers are trained using the 10-fold crossvalidation strategy, which splits the original sample into training and testing sets. In order to assess classifier efficiency, accuracy (AC), precision, recall, specificity, F1, and area under the curve are used (AUC). The Experiments showed that the AdaBoost classifier’s achieved an accuracy of 100% which is superior in both datasets in comparison with SVM and RF with AC of 97%. The accuracy is also compared with another study from the previous work that uses the same datasets, and the results demonstrated that the current research has better accuracy than the other study.
Keywords :
Classifier , AdaBoost , SVM , RF , ROC , Breast Cancer
Journal title :
International Journal of Nonlinear Analysis and Applications
Serial Year :
2021
Record number :
2703236
Link To Document :
بازگشت