DocumentCode :
3099470
Title :
An improved KNN algorithm for text classification
Author :
Wang, Lingzhong ; Li, Xia
Author_Institution :
Coll. of Inf. Eng., North China Univ. of Technol., Beijing, China
Volume :
2
fYear :
2010
fDate :
18-19 Oct. 2010
Abstract :
This paper analyzes the advantages and disadvantages of KNN alogrithm and introduces an improved KNN alogrithm (WPSOKN) for text classification. It is based on particle swarm optimization which has the ability of random and directed global search within training document set. During the procedure for searching k nearest neighbors of the test sample, those document vectors that are impossible to be the k closest vectors are kicked out quickly. Besides it reduces the impact of individual particles from the overall. Moreover, the interference factor is introduced to avoid premature to find the k nearest neighbors of test samples quickly. We conducted an extensive experimental study using real datasets, and the results show that the WPSOKNN algorithm is more efficient than other KNN algorithm.
Keywords :
particle swarm optimisation; pattern classification; search problems; text analysis; directed global search; improved k nearest neighbor algorithm; particle swarm optimization; text classification; Algorithm design and analysis; Artificial neural networks; Classification algorithms; Computers; Machine learning; Machine learning algorithms; Optimization; KNN; Particle Swarm Optimization; Text Classification; WPSOKNN;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Networking and Automation (ICINA), 2010 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-8104-0
Electronic_ISBN :
978-1-4244-8106-4
Type :
conf
DOI :
10.1109/ICINA.2010.5636476
Filename :
5636476
Link To Document :
بازگشت