DocumentCode :
2717958
Title :
Robust and discriminative distance for Multi-Instance Learning
Author :
Wang, Hua ; Nie, Feiping ; Huang, Heng
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
2919
Lastpage :
2924
Abstract :
Multi-Instance Learning (MIL) is an emerging topic in machine learning, which has broad applications in computer vision. For example, by considering video classification as a MIL problem where we only need labeled video clips (such as tagged online videos) but not labeled video frames, one can lower down the labeling cost, which is typically very expensive. We propose a novel class specific distance Metrics enhanced Class-to-Bag distance (M-C2B) method to learn a robust and discriminative distance for multi-instance data, which employs the not-squared ℓ2-norm distance to address the most difficult challenge in MIL, i.e., the outlier instances that abound in multi-instance data by nature. As a result, the formulated objective ends up to be a simultaneous ℓ2, 1-norm minimization and maximization (minmax) problem, which is very hard to solve in general due to the non-smoothness of the ℓ2, 1-norm. We thus present an efficient iterative algorithm to solve the general ℓ2, 1-norm minmax problem with rigorously proved convergence. To the best of our knowledge, we are the first to solve a general ℓ2, 1-norm minmax problem in literature. We have conducted extensive experiments to evaluate various aspects of the proposed method, in which promising results validate our new method in cost-effective video classification.
Keywords :
computer vision; image classification; iterative methods; learning (artificial intelligence); minimax techniques; minimisation; video signal processing; ℓ2-1-norm maximization problem; ℓ2-1-norm minimizationproblem; ℓ2-1-norm minmax problem; ℓ2-1-norm smoothness; M-C2B; MIL; class specific distance metrics enhanced class-to-bag distance method; computer vision; iterative algorithm; machine learning; multiinstance data; multiinstance learning; not-squared ℓ2-norm distance; robust distance; video classification; Algorithm design and analysis; Iterative methods; Labeling; Machine learning; Measurement; Robustness; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2012.6248019
Filename :
6248019
Link To Document :
بازگشت