• DocumentCode
    463399
  • Title

    A Novel Approch for Clustering of Chinese Text Based on Concept Hierarchy

  • Author

    Peng, Zhao ; Huan-tong, Geng ; Qing-sheng, Cai

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Univ. of Sci. & Technol. of China
  • Volume
    1
  • fYear
    2006
  • fDate
    17-19 July 2006
  • Firstpage
    607
  • Lastpage
    611
  • Abstract
    After analyzing the disadvantages of traditional text clustering method based on keywords set, a novel approach for clustering of Chinese text based on concept hierarchy is presented. It introduces a Chinese topic classify dictionary as background knowledge to clustering of Chinese text. It adopts a hierarchical coding system which reflects concept relevance among different words and uses vector space model based on concept hierarchy to represent Chinese text. The experimental results show this approach is more effective than traditional text clustering method based on keywords set
  • Keywords
    dictionaries; natural language processing; pattern clustering; text analysis; vectors; Chinese text clustering; Chinese topic classify dictionary; concept hierarchy; hierarchical coding system; vector space model; Clustering algorithms; Clustering methods; Computer science; Dictionaries; Documentation; Frequency; Signal analysis; Signal processing; Space technology; Text mining; Chinese topic classify dictionary; Text Clustering; Vector Space Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cognitive Informatics, 2006. ICCI 2006. 5th IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    1-4244-0475-4
  • Type

    conf

  • DOI
    10.1109/COGINF.2006.365554
  • Filename
    4216471