مرکز منطقه ای اطلاع رساني علوم و فناوري - Heterogeneous Visual Features Fusion via Sparse Multimodal Machine

DocumentCode :

3335028

Title :

Heterogeneous Visual Features Fusion via Sparse Multimodal Machine

Author :

Hua Wang ; Feiping Nie ; Heng Huang ; Ding, Chibiao

Author_Institution :

Dept. of Electr. Eng. & Comput. Sci., Colorado Sch. of Mines, Golden, CO, USA

fYear :

2013

fDate :

23-28 June 2013

Firstpage :

3097

Lastpage :

3102

Abstract :

To better understand, search, and classify image and video information, many visual feature descriptors have been proposed to describe elementary visual characteristics, such as the shape, the color, the texture, etc. How to integrate these heterogeneous visual features and identify the important ones from them for specific vision tasks has become an increasingly critical problem. In this paper, We propose a novel Sparse Multimodal Learning (SMML) approach to integrate such heterogeneous features by using the joint structured sparsity regularizations to learn the feature importance of for the vision tasks from both group-wise and individual point of views. A new optimization algorithm is also introduced to solve the non-smooth objective with rigorously proved global convergence. We applied our SMML method to five broadly used object categorization and scene understanding image data sets for both single-label and multi-label image classification tasks. For each data set we integrate six different types of popularly used image features. Compared to existing scene and object categorization methods using either single modality or multi-modalities of features, our approach always achieves better performances measured.

Keywords :

feature extraction; image classification; learning (artificial intelligence); optimisation; SMML method; elementary visual characteristics; feature descriptors; global convergence; heterogeneous visual features fusion; joint structured sparsity regularizations; multi label image classification tasks; nonsmooth objective; novel sparse multimodal learning approach; object categorization; optimization algorithm; scene understanding; single-label image classification tasks; sparse multimodal machine; Computer vision; Feature extraction; Image color analysis; Joints; Kernel; Support vector machines; Visualization; Data Integration; Structured Sparsity; Visual Features Fusion;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on

Conference_Location :

Portland, OR

ISSN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2013.398

Filename :

6619242

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3335028