Title :
An Improved EM-Based Semi-supervised Learning Method
Author :
Fan, Xinghua ; Guo, Zhiyi ; Ma, Houfeng
Author_Institution :
Coll. of Comput. Sci. & Technol., Chongqing Univ. of Posts & Telecommun., Chongqing, China
Abstract :
The iterative process in the standard EM-based semi-supervised learning includes two steps: firstly, use the classifier constructed in previous iteration to classify all unlabeled samples; then, train a new classifier based on the reconstructed training set, which is composed of labeled samples and all unlabeled samples. There is a problem in the process of reconstructing the training set: unlabeled samples should be divided into two parts by the classifier, reliable and misclassified, the misclassified part is considered as training samples. This affects the finally classification performance and convergence efficiency. This paper presents an improved method, which reconstructs the labeled samples set and then considers it as the training set in each iterative process. The process is implemented by adding the reliable part in unlabeled samples to labeled samples and forming a new labeled samples set. Experimental results in text classification show that the proposed method improves the classification performance and convergence efficiency.
Keywords :
data handling; iterative methods; learning (artificial intelligence); pattern classification; classification performance; convergence efficiency; data reconstruction; iterative process; reconstructed training set; semisupervised learning method; unlabeled samples classification; Bioinformatics; Convergence; Electromagnetic compatibility; Intelligent systems; Iterative algorithms; Iterative methods; Machine learning; Semisupervised learning; Systems biology; Text categorization; Data reconstruction; EM algorithm; Semi-supervised learning; ensemble learning;
Conference_Titel :
Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3739-9
DOI :
10.1109/IJCBS.2009.27