Title :
High Reliable Multi-View Semi-Supervised Learning with Extremely Sparse Labeled Data
Author_Institution :
Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai
Abstract :
Most semi-supervised learning methods assume there are a number of labeled data available in order to learn a classifier which then exploits a large set of unlabeled data. However, for some applications, there are only extremely spare labeled examples attainable (say, one example per category). In this case, these semi-supervised learning methods can not work. In this paper, a new method for seeking more examples with high reliable labels based on the limited labeled data is proposed. By investigating the correlation between different views through canonical correlation analysis, our method can launch semi-supervised learning using only one labeled example from each class. Experiments on text classification show the effectiveness of the proposed method.
Keywords :
learning (artificial intelligence); pattern classification; text analysis; canonical correlation analysis; extremely sparse labeled data; multiview semisupervised learning; text classification; Application software; Computer science; Content based retrieval; Hybrid intelligent systems; Image retrieval; Machine learning; Negative feedback; Semisupervised learning; Sun; Text categorization; Semi-supervised learning; canonical correlation analysis; co-training;
Conference_Titel :
Hybrid Intelligent Systems, 2008. HIS '08. Eighth International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-0-7695-3326-1
Electronic_ISBN :
978-0-7695-3326-1
DOI :
10.1109/HIS.2008.12