Title of article :
Prediction of Breast Cancer using Machine Learning Approaches
Author/Authors :
Rabiei ، Reza Department of Health Information Technology and Management - School of Allied Medical Sciences - Shahid Beheshti University of Medical Sciences , Ayyoubzadeh ، Mohammad Department of Health Information Technology and Management - School of Allied Medical Sciences - Tehran University of Medical Science , Sohrabei ، Solmaz Department Deputy of Development, Management and Resources - Office of Statistic and Information Technology Management - Zanjan University of Medical Sciences , Esmaeili ، Marzieh Department of Health Information Technology and Management - School of Allied Medical Sciences - Tehran University of Medical Science , Atashi ، Alireza Department of E-Health - Virtual School - Tehran University of Medical Sciences
From page :
297
To page :
308
Abstract :
Background: Breast cancer is considered one of the most common cancers in women caused by various clinical, lifestyle, social, and economic factors. Machine learning has the potential to predict breast cancer based on features hidden in data. Objective: This study aimed to predict breast cancer using different machinelearning approaches applying demographic, laboratory, and mammographic data. Material and Methods: In this analytical study, the database, including 5,178 independent records, 25% of which belonged to breast cancer patients with 24 attributes in each record was obtained from Motamed cancer institute (ACECR), Tehran, Iran. The database contained 5,178 independent records, 25% of which belonged to breast cancer patients containing 24 attributes in each record. The random forest (RF), neural network (MLP), gradient boosting trees (GBT), and genetic algorithms (GA) were used in this study. Models were initially trained with demographic and laboratory features (20 features). The models were then trained with all demographic, laboratory, and mammographic features (24 features) to measure the effectiveness of mammography features in predicting breast cancer. Results: RF presented higher performance compared to other techniques (accuracy 80%, sensitivity 95%, specificity 80%, and the area under the curve (AUC) 0.56). Gradient boosting (AUC=0.59) showed a stronger performance compared to the neural network. Conclusion: Combining multiple risk factors in modeling for breast cancer prediction could help the early diagnosis of the disease with necessary care plans. Collection, storage, and management of different data and intelligent systems based on multiple factors for predicting breast cancer are effective in disease management.
Keywords :
Artificial Intelligence , Breast cancer , computing methodologies , genetic algorithm , Machine Learning
Journal title :
Journal of Biomedical Physics and Engineering
Journal title :
Journal of Biomedical Physics and Engineering
Record number :
2706984
Link To Document :
بازگشت