Title :
Multiclass Semi-Supervised Boosting Using Similarity Learning
Author :
Tanha, Jafar ; Saberian, Mohammad Javad ; van Someren, Maarten
Author_Institution :
Inf. Inst., Univ. of Amsterdam, Amsterdam, Netherlands
Abstract :
In this paper, we consider the multiclass semi-supervised classification problem. A boosting algorithm is proposed to solve the multiclass problem directly. The proposed multiclass approach uses a new multiclass loss function, which includes two terms. The first term is the cost of the multiclass margin and the second term is a regularization term on unlabeled data. The regularization term is used to minimize the inconsistency between the pair wise similarity and the classifier predictions. It assigns the soft labels weighted with the similarity between unlabeled and labeled examples. We then derive a boosting algorithm, named CD-MSSBoost, from the proposed loss function using coordinate gradient descent. The derived algorithm is further used for learning optimal similarity function for a given data. Our experiments on a number of UCI datasets show that CD-MSSBoost outperforms the state-of-the-art methods to multiclass semi-supervised learning.
Keywords :
learning (artificial intelligence); pattern classification; CD-MSSBoost; UCI datasets; coordinate gradient descent; multiclass loss function; multiclass margin; multiclass semisupervised boosting algorithm; multiclass semisupervised classification problem; multiclass semisupervised learning; optimal similarity function learning; regularization term; unlabeled data; Algorithm design and analysis; Boosting; Optimization; Prediction algorithms; Semisupervised learning; Training; Boosting; Multiclass classification; Semi-Supervised Learning; Similarity learning;
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
DOI :
10.1109/ICDM.2013.108