• DocumentCode
    2127353
  • Title

    Automatic Chinese Keyword Extraction Based on KNN for Implicit Subject Extraction

  • Author

    Qingguo, Zhang ; Chengzhi, Zhang

  • Author_Institution
    Tongfang Knowledge Network Technol. Co., Ltd., Beijing
  • fYear
    2008
  • fDate
    21-22 Dec. 2008
  • Firstpage
    689
  • Lastpage
    692
  • Abstract
    In this paper, a method of automatic Chinese keyword extraction based on KNN is proposed. Firstly, it preprocesses the document by vector space model. Secondly, it constructs a set of candidate keywords based on KNN method and the labeled dataset. Finally, it post-processes on candidate keywords by the character of keyword to meet readers´ requirements Experimental results show the method proposed can not only improve the precision and recall of keyword extraction, but also extract implicit subject efficiently.
  • Keywords
    information retrieval; natural language processing; pattern classification; KNN method; automatic Chinese keyword extraction; implicit subject extraction; k-nearest neighbor; vector space model; Data mining; Data preprocessing; Euclidean distance; Frequency; Indexing; Information management; Knowledge acquisition; Learning systems; Statistics; Training data; Implicit Subject Extraction; KNN; Keyword Extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Knowledge Acquisition and Modeling, 2008. KAM '08. International Symposium on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3488-6
  • Type

    conf

  • DOI
    10.1109/KAM.2008.87
  • Filename
    4732916