DocumentCode :
2960757
Title :
Using closed captions to train activity recognizers that improve video retrieval
Author :
Gupta, Swastik ; Mooney, Raymond J
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA
fYear :
2009
fDate :
20-25 June 2009
Firstpage :
30
Lastpage :
37
Abstract :
Recognizing activities in real-world videos is a difficult problem exacerbated by background clutter, changes in camera angle & zoom, rapid camera movements etc. Large corpora of labeled videos can be used to train automated activity recognition systems, but this requires expensive human labor and time. This paper explores how closed captions that naturally accompany many videos can act as weak supervision that allows automatically collecting `labeled´ data for activity recognition. We show that such an approach can improve activity retrieval in soccer videos. Our system requires no manual labeling of video clips and needs minimal human supervision. We also present a novel caption classifier that uses additional linguistic information to determine whether a specific comment refers to an on-going activity. We demonstrate that combining linguistic analysis and automatically trained activity recognizers can significantly improve the precision of video retrieval.
Keywords :
classification; content-based retrieval; video retrieval; activity recognizers; automated activity recognition; background clutter; caption classifier; closed captions; human supervision; soccer videos; video retrieval; Broadcasting; Cameras; Content based retrieval; DVD; Games; Humans; Labeling; Multimedia communication; Multimedia systems; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition Workshops, 2009. CVPR Workshops 2009. IEEE Computer Society Conference on
Conference_Location :
Miami, FL
ISSN :
2160-7508
Print_ISBN :
978-1-4244-3994-2
Type :
conf
DOI :
10.1109/CVPRW.2009.5204202
Filename :
5204202
Link To Document :
بازگشت