DocumentCode :
3707817
Title :
A compact shot representation for video semantic indexing
Author :
Jinzhuo Wang;Wenmin Wang;Ronggang Wang;Wen Gao
Author_Institution :
School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University
fYear :
2015
Firstpage :
3265
Lastpage :
3269
Abstract :
This paper presents a compact shot representation for video semantic indexing (SIN). The proposed representation consists of visual cues from only two frames, i.e., key frame (KF) and difference frame (DF), which are both constructed with spatial pyramid. The KF describes static information while the generated DF captures non-static information. Each region of DF is derived from the same location in a selected frame, which has the most salient difference compared with the key frame in that region. We introduce a variation of DF to further enhance our model. Experimental results on TRECVID SIN demonstrate that our method obtains better accuracy than the state-of-the-art, while requiring less storage space and consuming time.
Keywords :
"Semantics","Visualization","Indexing","Feature extraction","Yttrium","Video signal processing","Histograms"
Publisher :
ieee
Conference_Titel :
Image Processing (ICIP), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/ICIP.2015.7351407
Filename :
7351407
Link To Document :
بازگشت