مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised Semantic Parsing of Video Collections

DocumentCode :

3748938

Title :

Unsupervised Semantic Parsing of Video Collections

Author :

Ozan Sener;Amir R. Zamir;Silvio Savarese;Ashutosh Saxena

fYear :

2015

Firstpage :

4480

Lastpage :

4488

Abstract :

Human communication typically has an underlying structure. This is reflected in the fact that in many user generated videos, a starting point, ending, and certain objective steps between these two can be identified. In this paper, we propose a method for parsing a video into such semantic steps in an unsupervised way. The proposed method is capable of providing a semantic "storyline" of the video composed of its objective steps. We accomplish this utilizing both visual and language cues in a joint generative model. The proposed method can also provide a textual description for each of identified semantic steps and video segments. We evaluate this method on a large number of complex YouTube videos and show results of unprecedented quality for this new and impactful problem.

Keywords :

"Visualization","Proposals","Semantics","YouTube","Atomic measurements","Conferences","Computer vision"

Publisher :

ieee

Conference_Titel :

Computer Vision (ICCV), 2015 IEEE International Conference on

Electronic_ISBN :

2380-7504

Type :

conf

DOI :

10.1109/ICCV.2015.509

Filename :

7410866

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3748938