Title :
Constructing a Fast Algorithm for Multi-label Classification with Support Vector Data Description
Author_Institution :
Sch. of Comput. Sci. & Technol., Nanjing Normal Univ., Nanjing, China
Abstract :
For multi-label classification, problem transform algorithms have received more attention due to their good performance and low computational complexity. But how to speed up training and test procedures is still a challenging issue. In this paper, one-by-one data decomposition trick is adopted to divide a k-label problem into k sub-problems, where a specific sub-problem only consists of instances with a specific class. We train each sub-classifier using support vector data description that learns a smallest hyper-sphere to capture the majority of training instances of each class, and integrate k sub-classifiers into an entire multi-label classification algorithm using both pseudo posterior probabilities and linear ridge regression. Our new method has the lowest time complexity, compared with existing problem transform support vector machines for multi-label classification. Experimental results on the Yeast dataset illustrate that our algorithm works better than several state-of-the-art ones.
Keywords :
data mining; probability; regression analysis; support vector machines; k-label problem; linear ridge regression; multilabel classification; one-by-one data decomposition trick; problem transform algorithm; pseudo posterior probability; support vector data description; Classification algorithms; Kernel; Loss measurement; Support vector machine classification; Training; Transforms; classification; decomposition; kernel; multi-label; support vector data description;
Conference_Titel :
Granular Computing (GrC), 2010 IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-7964-1
DOI :
10.1109/GrC.2010.107