مرکز منطقه ای اطلاع رساني علوم و فناوري - MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition

DocumentCode :

3748567

Title :

MMSS: Multi-modal Sharable and Specific Feature Learning for RGB-D Object Recognition

Author :

Anran Wang;Jianfei Cai;Jiwen Lu;Tat-Jen Cham

Author_Institution :

Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore

fYear :

2015

Firstpage :

1125

Lastpage :

1133

Abstract :

Most of the feature-learning methods for RGB-D object recognition either learn features from color and depth modalities separately, or simply treat RGB-D as undifferentiated four-channel data, which cannot adequately exploit the relationship between different modalities. Motivated by the intuition that different modalities should contain not only some modal-specific patterns but also some shared common patterns, we propose a multi-modal feature learning framework for RGB-D object recognition. We first construct deep CNN layers for color and depth separately, and then connect them with our carefully designed multi-modal layers, which fuse color and depth information by enforcing a common part to be shared by features of different modalities. In this way, we obtain features reflecting shared properties as well as modal-specific properties in different modalities. The information of the multi-modal learning frameworks is back-propagated to the early CNN layers. Experimental results show that our proposed multi-modal feature learning method outperforms state-of-the-art approaches on two widely used RGB-D object benchmark datasets.

Keywords :

"Image color analysis","Object recognition","Feature extraction","Labeling","Sparse matrices","Computer vision","Learning systems"

Publisher :

ieee

Conference_Titel :

Computer Vision (ICCV), 2015 IEEE International Conference on

Electronic_ISBN :

2380-7504

Type :

conf

DOI :

10.1109/ICCV.2015.134

Filename :

7410491

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3748567