• DocumentCode
    2796404
  • Title

    Building a concept hierarchy automatically and its measuring

  • Author

    Kuo, Huang-Cheng ; Lai, Hung-Chung ; Huang, Jen-Peng

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Chiayi Univ., Chiayi
  • Volume
    7
  • fYear
    2008
  • fDate
    12-15 July 2008
  • Firstpage
    3975
  • Lastpage
    3978
  • Abstract
    Concept hierarchies are important for generalization in many data mining applications. Abundant algorithms have been proposed for automatic construction of concept hierarchy. A typical application of such algorithms is constructing directories for documents in information retrieval community. However, the research result can not be directly adopted for automatic construction of concept hierarchies for objects with identifiers only, such as items in market basket database where items have no attribute and only similarities between items are available. So, the metrics for directories for documents are not suitable for hierarchies for identifier-only data. In this paper, we propose a measurement that considers the unevenness of similarities among objects in the child nodes. We use the unevenness value to express the balance of concept hierarchies. For constructing a concept hierarchy, we propose a hierarchical clustering with join/merge decision (HCJMD) which is modified from hierarchical agglomerative clustering (HAC).
  • Keywords
    data mining; information retrieval; pattern clustering; concept hierarchy; data mining; hierarchical agglomerative clustering; hierarchical clustering with join-merge decision; information retrieval; unevenness value; Application software; Computer science; Conference management; Cybernetics; Data engineering; Data mining; Frequency; Information management; Information retrieval; Machine learning; Concept hierarchy; data mining; hierarchical agglomerative clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2008 International Conference on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2095-7
  • Electronic_ISBN
    978-1-4244-2096-4
  • Type

    conf

  • DOI
    10.1109/ICMLC.2008.4621097
  • Filename
    4621097