Title :
A Clustering-Based Approach to Analyse Examinations for Diabetic Patients
Author :
Bruno, Giulia ; Cerquitelli, Tania ; Chiusano, Silvia ; Xin Xiao
Author_Institution :
Dipt. di Ing. Gestionale e della Produzione, Politec. di Torino, Turin, Italy
Abstract :
Health care data collections are usually characterized by an inherent sparseness due to a large cardinality of patient records and a variety of medical treatments usually adopted for a given pathology. Innovative data analytics approaches are needed to effectively extract interesting knowledge from these large collections. This paper presents an explorative data mining approach, based on a density-based clustering algorithm, to identify the examinations commonly followed by patients with a given disease. To cluster patients undergoing similar medical treatments and sharing common patient profiles (i.e., Patient age and gender) a novel combined distance measure has been proposed. Furthermore, to focus on different dataset portions and locally identify groups of patients, the clustering algorithm has been exploited in a multiple-level fashion. Based on this cluster set, a classification model has been created to characterize the content of clusters and measure the effectiveness of the clustering process. The experiments, performed on a real diabetic patient dataset, demonstrate the effectiveness of the proposed approach in discovering interesting groups of patients with a similar examination history and with increasing disease severity.
Keywords :
data acquisition; data analysis; data mining; diseases; health care; medical computing; patient treatment; pattern classification; pattern clustering; classification model; cluster set; density-based clustering algorithm; diabetic patient dataset; diabetic patients; disease severity; examination analysis; explorative data mining approach; health care data collection; innovative data analytics; medical treatments; multiple-level fashion; pathology; patient age; patient gender; patient profiles; patient records; Clustering algorithms; Data mining; Decision trees; Diabetes; Diseases; History; Vectors; classification; cluster analysis; diabetes; patient examination history;
Conference_Titel :
Healthcare Informatics (ICHI), 2014 IEEE International Conference on
DOI :
10.1109/ICHI.2014.14