مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning Shared, Discriminative, and Compact Representations for Visual Recognition

DocumentCode :

3601482

Title :

Learning Shared, Discriminative, and Compact Representations for Visual Recognition

Author :

Lobel, Hans ; Vidal, Rene ; Soto, Alvaro

Author_Institution :

Dept. of Comput. Sci., Pontificia Univ. Catolica de Chile, Santiago, Chile

Volume :

Issue :

fYear :

2015

Firstpage :

2218

Lastpage :

2231

Abstract :

Dictionary-based and part-based methods are among the most popular approaches to visual recognition. In both methods, a mid-level representation is built on top of low-level image descriptors and high-level classifiers are trained on top of the mid-level representation. While earlier methods built the mid-level representation without supervision, there is currently great interest in learning both representations jointly to make the mid-level representation more discriminative. In this work we propose a new approach to visual recognition that jointly learns a shared, discriminative, and compact mid-level representation and a compact high-level representation. By using a structured output learning framework, our approach directly handles the multiclass case at both levels of abstraction. Moreover, by using a group-sparse prior in the structured output learning framework, our approach encourages sharing of visual words and thus reduces the number of words used to represent each class. We test our proposed method on several popular benchmarks. Our results show that, by jointly learning midand high-level representations, and fostering the sharing of discriminative visual words among target classes, we are able to achieve state-of-the-art recognition performance using far less visual words than previous approaches.

Keywords :

image classification; image recognition; image representation; learning (artificial intelligence); compact representation; dictionary-based method; discriminative representation; discriminative visual words; group-sparse prior; high-level classifier; low-level image descriptors; midlevel representation; part-based method; shared representation; structured output learning framework; visual recognition; Complexity theory; Dictionaries; Encoding; Feature extraction; Joints; Optimization; Visualization; Dictionary Learning; Group Sparsity; Image Categorization; Image categorization; Max-margin Learning; Structural SVMs; dictionary learning; group sparsity; max-margin learning; structural SVMs;

fLanguage :

English

Journal_Title :

Pattern Analysis and Machine Intelligence, IEEE Transactions on

Publisher :

ieee

ISSN :

0162-8828

Type :

jour

DOI :

10.1109/TPAMI.2015.2408349

Filename :

7053941

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3601482