DocumentCode
2007284
Title
Extraction of Meaningful Rules in a Medical Database
Author
Suh, Sang C. ; Saffer, Sam ; Adla, Naveen Kumar
Author_Institution
Dept. of Comput. Sci., Texas A & M Univ.- Commerce, Commerce, CA, USA
fYear
2008
fDate
11-13 Dec. 2008
Firstpage
450
Lastpage
456
Abstract
Clustering enhances the value of existing databases by revealing rules in the data. These rules are useful for understanding trends, making predictions of future events from historical data, or synthesizing data records into meaningful clusters. Through clustering are similar data items grouped together to form clusters. Clustering algorithms usually employ a distance metric based (e.g., Euclidean) similarity measure in order to partition the database such that data points in the same partition are more similar than points in different partitions. In this paper, we study clustering algorithms for data with categorical attributes. Instead of using traditional clustering algorithms that use distances between points for clustering which is not an appropriate concept for Boolean and categorical attributes, we propose a novel concept of HAC (hierarchy of attributes and concepts) to measure the similarity/proximity between a pair of data points. In this study, HAC will be used as an aid to represent medical domain knowledge substructures to simplify the generation process of the databases through clustering. As a result, the research will identify interesting relationships and patterns among the data, and represent them in the form of association rules.
Keywords
data mining; medical information systems; association rules; categorical attributes; clustering algorithm; distance metric; medical database; Application software; Association rules; Business; Clustering algorithms; Computer science; Data analysis; Data mining; Databases; Machine learning; Partitioning algorithms; Association Rules; Hierarchical Clustering; Medical Databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
Conference_Location
San Diego, CA
Print_ISBN
978-0-7695-3495-4
Type
conf
DOI
10.1109/ICMLA.2008.123
Filename
4725012
Link To Document