Title :
Building a concept hierarchy automatically and its measuring
Author :
Kuo, Huang-Cheng ; Lai, Hung-Chung ; Huang, Jen-Peng
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Chiayi Univ., Chiayi
Abstract :
Concept hierarchies are important for generalization in many data mining applications. Abundant algorithms have been proposed for automatic construction of concept hierarchy. A typical application of such algorithms is constructing directories for documents in information retrieval community. However, the research result can not be directly adopted for automatic construction of concept hierarchies for objects with identifiers only, such as items in market basket database where items have no attribute and only similarities between items are available. So, the metrics for directories for documents are not suitable for hierarchies for identifier-only data. In this paper, we propose a measurement that considers the unevenness of similarities among objects in the child nodes. We use the unevenness value to express the balance of concept hierarchies. For constructing a concept hierarchy, we propose a hierarchical clustering with join/merge decision (HCJMD) which is modified from hierarchical agglomerative clustering (HAC).
Keywords :
data mining; information retrieval; pattern clustering; concept hierarchy; data mining; hierarchical agglomerative clustering; hierarchical clustering with join-merge decision; information retrieval; unevenness value; Application software; Computer science; Conference management; Cybernetics; Data engineering; Data mining; Frequency; Information management; Information retrieval; Machine learning; Concept hierarchy; data mining; hierarchical agglomerative clustering;
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
DOI :
10.1109/ICMLC.2008.4621097