Title :
Flexible KNN Algorithm for Text Categorization by Authorship Based on Features of Lingual Conceptual Expression
Author :
Yunliang, Zhang ; Lijun, Zhu ; Xiaodong, Qiao ; Quan, Zhang
Author_Institution :
Inst. of Sci. & Tech. Inf. of China, Beijing, China
fDate :
March 31 2009-April 2 2009
Abstract :
Text categorization by authorship is useful in some applications and lingual conceptual expression is an effective expression to reduce the dimension of the VSM. In this application, we use KNN algorithm, which is a common, efficient and effective text categorization algorithm. In standard KNN algorithm, the K is fixed for different processing texts, and the weights for neighbors are equal. In this paper, a flexible KNN algorithm is combined with k-variable algorithm and weighting algorithm, which improves the effect of text categorization.
Keywords :
computational linguistics; pattern classification; text analysis; authorship; flexible KNN algorithm; k-variable algorithm; lingual conceptual expression; text categorization effect; weighting algorithm; Acoustic applications; Acoustical engineering; Computer science; Content addressable storage; Frequency; Internet; Natural languages; Terminology; Testing; Text categorization;
Conference_Titel :
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-0-7695-3507-4
DOI :
10.1109/CSIE.2009.363