مرکز منطقه ای اطلاع رساني علوم و فناوري - Discriminative Feature Learning for Video Semantic Segmentation

DocumentCode :

3661733

Title :

Discriminative Feature Learning for Video Semantic Segmentation

Author :

Han Zhang;Kai Jiang;Yu Zhang;Qing Li;Changqun Xia;Xiaowu Chen

Author_Institution :

State Key Lab. of Virtual Reality Technol. &

fYear :

2014

Firstpage :

321

Lastpage :

326

Abstract :

In this paper, we propose a novel deep learning based method for video semantic segmentation. Specially, we utilize 3D convolution neural network (3D CNN) to learn discriminative hierarchical features from spatial-temporal volumes for accurate pixel labelling. The learned features are capable of capturing both appearance and motion information. To align the pixel labels along real object boundaries, as well as maintain local consistency, we further perform graph-cut on a graph constructed on coherent 3d regions, or super-voxels, extracted from input video. Experiments demonstrate that due to the discriminative capability of learned features, our approach can obtain competitive labelling accuracy compared to the state-of-art in absence of sophisticated inference models, even with few training samples.

Keywords :

"Three-dimensional displays","Feature extraction","Semantics","Labeling","Accuracy","Computer vision","Image segmentation"

Publisher :

ieee

Conference_Titel :

Virtual Reality and Visualization (ICVRV), 2014 International Conference on

Type :

conf

DOI :

10.1109/ICVRV.2014.65

Filename :

7281085

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3661733