DocumentCode :
2780548
Title :
Particle swarm optimization based semi-supervised learning on Chinese text categorization
Author :
Shi Cheng ; Yuhui Shi ; Quande Qin
Author_Institution :
Dept. of Electr. Eng. & Electron., Univ. of Liverpool, Liverpool, UK
fYear :
2012
fDate :
10-15 June 2012
Firstpage :
1
Lastpage :
8
Abstract :
For many large scale learning problems, acquiring a large amount of labeled training data is expensive and time-consuming. Semi-supervised learning is a machine learning paradigm which deals with utilizing unlabeled data to build better classifiers. However, unlabeled data with wrong predictions will mislead the classifier. In this paper, we proposed a particle swarm optimization based semi-learning classifier to solve Chinese text categorization problem. This classifier utilizes an iterative strategy, and the result of classifier is determined by a document´s previous prediction and its neighbors´ information. The new classifier is tested on a Chinese text corpus. The proposed classifier is compared with the k nearest neighbor method, the k weighted nearest neighbor method, and the self-learning classifier.
Keywords :
classification; iterative methods; learning (artificial intelligence); natural language processing; particle swarm optimisation; text analysis; Chinese text categorization; Chinese text corpus; document prediction; iterative strategy; labeled training data; large scale learning problem; machine learning paradigm; particle swarm optimization; semilearning classifier; semisupervised learning; unlabeled data; Equations; Error analysis; Mathematical model; Particle swarm optimization; Text categorization; Training; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation (CEC), 2012 IEEE Congress on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1510-4
Electronic_ISBN :
978-1-4673-1508-1
Type :
conf
DOI :
10.1109/CEC.2012.6252959
Filename :
6252959
Link To Document :
بازگشت