DocumentCode :
2984486
Title :
Handling Ambiguity via Input-Output Kernel Learning
Author :
Xinxing Xu ; Tsang, Ivor W. ; Dong Xu
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
725
Lastpage :
734
Abstract :
Data ambiguities exist in many data mining and machine learning applications such as text categorization and image retrieval. For instance, it is generally beneficial to utilize the ambiguous unlabeled documents to learn a more robust classifier for text categorization under the semi-supervised learning setting. To handle general data ambiguities, we present a unified kernel learning framework named Input-Output Kernel Learning (IOKL). Based on our framework, we further propose a novel soft margin group sparse Multiple Kernel Learning (MKL) formulation by introducing a group kernel slack variable to each group of base input-output kernels. Moreover, an efficient block-wise coordinate descent algorithm with an analytical solution for the kernel combination coefficients is developed to solve the proposed formulation. We conduct comprehensive experiments on benchmark datasets for both semi-supervised learning and multiple instance learning tasks, and also apply our IOKL framework to a computer vision application called text-based image retrieval on the NUS-WIDE dataset. Promising results demonstrate the effectiveness of our proposed IOKL framework.
Keywords :
data mining; learning (artificial intelligence); operating system kernels; text analysis; NUS-WIDE dataset; ambiguous unlabeled documents; benchmark datasets; block wise coordinate descent algorithm; computer vision application; data mining; general data ambiguity; group kernel slack; input output kernel learning; kernel combination coefficients; kernel learning framework; machine learning application; multiple instance learning task; multiple kernel learning formulation; robust classifier; semisupervised learning setting; text based image retrieval; text categorization; Kernel; Linear programming; Semisupervised learning; Support vector machines; Training; Uncertainty; Vectors; Group Multiple Kernel Learning; Input-Output Kernel Learning; Multi-Instance Learning; Semi-supervised Learning; Text-based Image Retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
ISSN :
1550-4786
Print_ISBN :
978-1-4673-4649-8
Type :
conf
DOI :
10.1109/ICDM.2012.105
Filename :
6413856
Link To Document :
بازگشت