Title :
Data Mining in Bone Marrow Transplant Records to Identify Patients With High Odds of Survival
Author :
Taati, Babak ; Snoek, Jasper ; Aleman, Dionne ; Ghavamzadeh, Ardeshir
Author_Institution :
Toronto Rehabilitation Inst., Univ. Health Network, Toronto, ON, Canada
Abstract :
Patients undergoing a bone marrow stem cell transplant (BMT) face various risk factors. Analyzing data from past transplants could enhance the understanding of the factors influencing success. Records up to 120 measurements per transplant procedure from 1751 patients undergoing BMT were collected (Shariati Hospital). Collaborative filtering techniques allowed the processing of highly sparse records with 22.3% missing values. Ten-fold cross-validation was used to evaluate the performance of various classification algorithms trained on predicting the survival status. Modest accuracy levels were obtained in predicting the survival status (AUC = 0.69). More importantly, however, operations that had the highest chances of success were shown to be identifiable with high accuracy, e.g., 92% or 97% when identifying 74 or 31 recipients, respectively. Identifying the patients with the highest chances of survival has direct application in the prioritization of resources and in donor matching. For patients where high-confidence prediction is not achieved, assigning a probability to their survival odds has potential applications in probabilistic decision support systems and in combination with other sources of information.
Keywords :
collaborative filtering; data mining; medical expert systems; medical information systems; patient treatment; pattern classification; BMT success; Shariati Hospital; bone marrow stem cell transplant; bone marrow transplant records; classification algorithm training; collaborative filtering techniques; data mining; donor matching; highly sparse data processing; past transplant data; patient risk factors; patient survival rate; probabilistic decision support systems; resource prioritization; survival status prediction; Bayes methods; Collaboration; Matrix decomposition; Optimization; Probabilistic logic; Robustness; Sparse matrices; Bone marrow transplant; collaborative filtering; data mining; donor matching; health records; matrix factorization; principal component analysis; recommender systems; survival;
Journal_Title :
Biomedical and Health Informatics, IEEE Journal of
DOI :
10.1109/JBHI.2013.2274733