مرکز منطقه ای اطلاع رساني علوم و فناوري - Matrix Completion for Weakly-Supervised Multi-Label Image Classification

DocumentCode :

111140

Title :

Matrix Completion for Weakly-Supervised Multi-Label Image Classification

Author :

Cabral, Ricardo ; De la Torre, Fernando ; Costeira, Joao P. ; Bernardino, Alexandre

Author_Institution :

ECE Dept., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume :

Issue :

fYear :

2015

fDate :

Jan. 1 2015

Firstpage :

121

Lastpage :

135

Abstract :

In the last few years, image classification has become an incredibly active research topic, with widespread applications. Most methods for visual recognition are fully supervised, as they make use of bounding boxes or pixelwise segmentations to locate objects of interest. However, this type of manual labeling is time consuming, error prone and it has been shown that manual segmentations are not necessarily the optimal spatial enclosure for object classifiers. This paper proposes a weakly-supervised system for multi-label image classification. In this setting, training images are annotated with a set of keywords describing their contents, but the visual concepts are not explicitly segmented in the images. We formulate the weakly-supervised image classification as a low-rank matrix completion problem. Compared to previous work, our proposed framework has three advantages: (1) Unlike existing solutions based on multiple-instance learning methods, our model is convex. We propose two alternative algorithms for matrix completion specifically tailored to visual data, and prove their convergence. (2) Unlike existing discriminative methods, our algorithm is robust to labeling errors, background noise and partial occlusions. (3) Our method can potentially be used for semantic segmentation. Experimental validation on several data sets shows that our method outperforms state-of-the-art classification algorithms, while effectively capturing each class appearance.

Keywords :

image classification; image segmentation; learning (artificial intelligence); matrix algebra; background noise; bounding boxes; discriminative methods; labeling errors; low-rank matrix completion problem; manual labeling; manual segmentations; matrix completion; multiple-instance learning methods; object classifiers; optimal spatial enclosure; partial occlusions; pixelwise segmentations; semantic segmentation; state-of-the-art classification algorithms; training images; visual concepts; visual recognition; weakly-supervised multilabel image classification; Histograms; Image segmentation; Minimization; Pattern analysis; Semantics; Training; Vectors; Weakly-supervised learning; multi-label image classification; nuclear norm; rank minimization; segmentation;

fLanguage :

English

Journal_Title :

Pattern Analysis and Machine Intelligence, IEEE Transactions on

Publisher :

ieee

ISSN :

0162-8828

Type :

jour

DOI :

10.1109/TPAMI.2014.2343234

Filename :

6866218

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=111140