Title :
Semi-supervised classification algorithm based on the KNN
Author :
Chang, Yawei ; Liu, Houquan
Author_Institution :
Sch. of Comput. Sci. & Technol., China Univ. of Min. & Technol., Xuzhou, China
Abstract :
KNN algorithm is a classification algorithm based on examples. For a test documentation, we need to calculate the similarity with each text of the training sample focus, the computation complexity is very high. According to this problem, this paper puts forward a method based on the EM-KNN semi-supervised classification algorithm. Firstly, the algorithm to cluster the training set, calculate the center of each category, then combine the center of each category and the clustering the text to form new training set. Finally train the new training set with classical KNN algorithm. Experimental results show that computational complexity can be reduced largely and the performance of the classifier can be improved by this algorithm.
Keywords :
computational complexity; expectation-maximisation algorithm; pattern classification; text analysis; EM-KNN semisupervised classification algorithm; computation complexity; test documentation; text similarity; Classification algorithms; Clustering algorithms; Programming; Classification; Clustering; EM; KNN; Semi-supervised;
Conference_Titel :
Communication Software and Networks (ICCSN), 2011 IEEE 3rd International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-61284-485-5
DOI :
10.1109/ICCSN.2011.6014376