DocumentCode
2860869
Title
Content-Based TV Sports Video Retrieval Based on Audio-Visual Features and Text Information
Author
Huayong, Liu
Author_Institution
Central China Normal University, China
fYear
2004
fDate
20-24 Sept. 2004
Firstpage
481
Lastpage
484
Abstract
In this paper, we propose content-based video retrieval, which is a kind of retrieval by its semantical contents. Because video data is composed of multimodal information streams such as visual, auditory and textual streams, we describe a strategy of using multimodal analysis for automatic parsing sports video. The paper first defines the basic structure of sports video database system, and then introduces a new approach that integrates visual streams analysis, speech recognition, speech signal processing and text extraction to realize video retrieval. The experimental results for TV sports video of football games indicate that multimodal analysis is effective for video retrieval by quickly browsing tree-like video clips or inputting keywords within predefined domain.
Keywords
Content based retrieval; Database systems; Information analysis; Information retrieval; Signal analysis; Speech analysis; Speech processing; Speech recognition; Streaming media; TV;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2100-2
Type
conf
DOI
10.1109/WI.2004.10107
Filename
1410849
Link To Document