Title :
Clustering medical data to predict the likelihood of diseases
Author :
Paul, Razan ; Hoque, Abu Sayed Md Latiful
Author_Institution :
Dept. of Comput. Sci. & Eng., Bangladesh Univ. of Eng. & Technol., Dhaka, Bangladesh
Abstract :
Several studies show that background knowledge of a domain can improve the results of clustering algorithms. In this paper, we illustrate how to use the background knowledge of medical domain in clustering process to predict the likelihood of diseases. To find the likelihood of diseases, clustering has to be done based on anticipated likelihood attributes with core attributes of disease in data point. To find the likelihood of diseases, we have proposed constraint k-Means-Mode clustering algorithm. Attributes of Medical data are both continuous and categorical. The developed algorithm can handle both continuous and discrete data and perform clustering based on anticipated likelihood attributes with core attributes of disease in data point. We have demonstrated its effectiveness by testing it for a real world patient data set.
Keywords :
diseases; medical administrative data processing; pattern clustering; constraint k-means-mode clustering algorithm; diseases likelihood prediction; medical data clustering; Accuracy; Boolean functions; Clustering algorithms; Dictionaries; Diseases; Medical diagnostic imaging; Prediction algorithms;
Conference_Titel :
Digital Information Management (ICDIM), 2010 Fifth International Conference on
Conference_Location :
Thunder Bay, ON
Print_ISBN :
978-1-4244-7572-8
DOI :
10.1109/ICDIM.2010.5664638