Title :
Semi-supervised Learning Applied to Large Data Sets with Very Few Labeled Examples
Author :
Chen, Hong ; Guo, Gongde
Author_Institution :
Sch. of Math. & Comput. Sci., Fujian Normal Univ., Fuzhou, China
Abstract :
A semi-supervised classification approach, SS-LFL, is proposed. In SS-LFL, some weak binary classifiers, each of which can identify instances of one particular class, are firstly trained on the labeled data, and the whole data set is then clustered into partitions until they are tight and pure enough. SS-LFL alternates between assigning ¿imperfect-classes¿ to the unlabeled data in these partitions and constructing the next weak binary classifiers using both the labeled and ¿imperfect¿ data. It works well in large data sets with very few labeled examples, moreover, it neither requires known parametric distributions of data nor participation of an expert. Experimental results carried out on some public datasets collected from the UCI machine learning repository show that SS-LFL is a promising method.
Keywords :
learning (artificial intelligence); pattern classification; pattern clustering; UCI machine learning repository; binary classifiers; semisupervised classification approach; semisupervised learning; Application software; Computer science; Content based retrieval; Fuzzy systems; Humans; Image retrieval; Information retrieval; Machine learning; Mathematics; Semisupervised learning; Large Data Sets; Semi-Supervised Learning; Very Few Labeled Examples;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.196