مرکز منطقه ای اطلاع رساني علوم و فناوري - A compact shot representation for video semantic indexing

DocumentCode :

3707817

Title :

A compact shot representation for video semantic indexing

Author :

Jinzhuo Wang;Wenmin Wang;Ronggang Wang;Wen Gao

Author_Institution :

School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University

fYear :

2015

Firstpage :

3265

Lastpage :

3269

Abstract :

This paper presents a compact shot representation for video semantic indexing (SIN). The proposed representation consists of visual cues from only two frames, i.e., key frame (KF) and difference frame (DF), which are both constructed with spatial pyramid. The KF describes static information while the generated DF captures non-static information. Each region of DF is derived from the same location in a selected frame, which has the most salient difference compared with the key frame in that region. We introduce a variation of DF to further enhance our model. Experimental results on TRECVID SIN demonstrate that our method obtains better accuracy than the state-of-the-art, while requiring less storage space and consuming time.

Keywords :

"Semantics","Visualization","Indexing","Feature extraction","Yttrium","Video signal processing","Histograms"

Publisher :

ieee

Conference_Titel :

Image Processing (ICIP), 2015 IEEE International Conference on

Type :

conf

DOI :

10.1109/ICIP.2015.7351407

Filename :

7351407

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3707817