Title : 
Automatic Chinese Keyword Extraction Based on KNN for Implicit Subject Extraction
         
        
            Author : 
Qingguo, Zhang ; Chengzhi, Zhang
         
        
            Author_Institution : 
Tongfang Knowledge Network Technol. Co., Ltd., Beijing
         
        
        
        
        
        
            Abstract : 
In this paper, a method of automatic Chinese keyword extraction based on KNN is proposed. Firstly, it preprocesses the document by vector space model. Secondly, it constructs a set of candidate keywords based on KNN method and the labeled dataset. Finally, it post-processes on candidate keywords by the character of keyword to meet readers´ requirements Experimental results show the method proposed can not only improve the precision and recall of keyword extraction, but also extract implicit subject efficiently.
         
        
            Keywords : 
information retrieval; natural language processing; pattern classification; KNN method; automatic Chinese keyword extraction; implicit subject extraction; k-nearest neighbor; vector space model; Data mining; Data preprocessing; Euclidean distance; Frequency; Indexing; Information management; Knowledge acquisition; Learning systems; Statistics; Training data; Implicit Subject Extraction; KNN; Keyword Extraction;
         
        
        
        
            Conference_Titel : 
Knowledge Acquisition and Modeling, 2008. KAM '08. International Symposium on
         
        
            Conference_Location : 
Wuhan
         
        
            Print_ISBN : 
978-0-7695-3488-6
         
        
        
            DOI : 
10.1109/KAM.2008.87