DocumentCode :
495780
Title :
Flexible KNN Algorithm for Text Categorization by Authorship Based on Features of Lingual Conceptual Expression
Author :
Yunliang, Zhang ; Lijun, Zhu ; Xiaodong, Qiao ; Quan, Zhang
Author_Institution :
Inst. of Sci. & Tech. Inf. of China, Beijing, China
Volume :
2
fYear :
2009
fDate :
March 31 2009-April 2 2009
Firstpage :
601
Lastpage :
605
Abstract :
Text categorization by authorship is useful in some applications and lingual conceptual expression is an effective expression to reduce the dimension of the VSM. In this application, we use KNN algorithm, which is a common, efficient and effective text categorization algorithm. In standard KNN algorithm, the K is fixed for different processing texts, and the weights for neighbors are equal. In this paper, a flexible KNN algorithm is combined with k-variable algorithm and weighting algorithm, which improves the effect of text categorization.
Keywords :
computational linguistics; pattern classification; text analysis; authorship; flexible KNN algorithm; k-variable algorithm; lingual conceptual expression; text categorization effect; weighting algorithm; Acoustic applications; Acoustical engineering; Computer science; Content addressable storage; Frequency; Internet; Natural languages; Terminology; Testing; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
Type :
conf
DOI :
10.1109/CSIE.2009.363
Filename :
5171409
Link To Document :
بازگشت