DocumentCode :
2636691
Title :
FAVC: Clustering Categorical Data Using the Frequency of Attribute Values Combinations
Author :
Do, Hee-Jung ; Kim, Jae-Yearn
Author_Institution :
Dept. of Ind. Eng., Hanyang Univ., Seoul
fYear :
2008
fDate :
18-20 June 2008
Firstpage :
304
Lastpage :
304
Abstract :
This paper proposes a new clustering algorithm for categorical data based on the frequency of attribute values combinations (FAVC). This algorithm finds all the combinations of attribute values in a record (which represent a subset of all the attribute values), and then groups the records using the frequency of these combinations. As the FAVC algorithm considers all the subsets of attribute values in a record, records in a cluster have not only similar attribute value sets but also strongly associated attribute values. The FAVC algorithm evaluated with real and synthetic data sets. The FAVC is shown better clustering results and superior running time in comparison with that of COOLCAT.
Keywords :
category theory; data handling; pattern clustering; COOLCAT; attribute value combination frequency; categorical data clustering; clustering algorithm; Clustering algorithms; Entropy; Euclidean distance; Frequency; Industrial engineering; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovative Computing Information and Control, 2008. ICICIC '08. 3rd International Conference on
Conference_Location :
Dalian, Liaoning
Print_ISBN :
978-0-7695-3161-8
Electronic_ISBN :
978-0-7695-3161-8
Type :
conf
DOI :
10.1109/ICICIC.2008.275
Filename :
4603493
Link To Document :
بازگشت