Title :
Decision tree discovery for the diagnosis of type II diabetes
Author :
Al Jarullah, Asma A
Author_Institution :
Dept. of Inf. Syst., King Saud Univ., Riyadh, Saudi Arabia
Abstract :
The discovery of knowledge from medical databases is important in order to make effective medical diagnosis. The aim of data mining is to extract knowledge from information stored in database and generate clear and understandable description of patterns. In this study, decision tree method was used to predict patients with developing diabetes. The dataset used is the Pima Indians Diabetes Data Set, which collects the information of patients with and without developing diabetes. The study goes through two phases. The first phase is data preprocessing including attribute identification and selection, handling missing values, and numerical discretization. The second phase is a diabetes prediction model construction using the decision tree method. Weka software was used throughout all the phases of this study.
Keywords :
data mining; decision trees; medical computing; patient diagnosis; Pima Indians diabetes data set; Weka software; attribute identification; attribute selection; decision tree discovery; knowledge discovery; medical databases; medical diagnosis; numerical discretization; type II diabetes diagnosis; Data mining; Databases; Decision trees; Diabetes; Medical diagnostic imaging; Predictive models; Sugar; Decision Tree; data mining; diabetes; diagnosis;
Conference_Titel :
Innovations in Information Technology (IIT), 2011 International Conference on
Conference_Location :
Abu Dhabi
Print_ISBN :
978-1-4577-0311-9
DOI :
10.1109/INNOVATIONS.2011.5893838