DocumentCode :
3239650
Title :
Multi-modal audio-visual event recognition for football analysis
Author :
Barnard, Mark ; Odobez, Jean-Marc ; Bengio, Samy
Author_Institution :
Dalle Molle Inst. for Perceptual Artificial Intelligence, Martigny, Switzerland
fYear :
2003
fDate :
17-19 Sept. 2003
Firstpage :
469
Lastpage :
478
Abstract :
The recognition of events within multi-modal data is a challenging problem. In this paper we focus on the recognition of events by using both audio and video data. We investigate the use of data fusion techniques in order to recognise these sequences within the framework of hidden Markov models (HMM) used to model audio and video data sequences. Specifically we look at the recognition of play and break sequences in football and the segmentation of football games based on these two events. Recognising relatively simple semantic events such as this is an important step towards full automatic indexing of such video material. These experiments were done using approximately 3 hours of data from two games of the Euro96 competition. We propose that modelling the audio and video streams separately for each sequence and fusing the decisions from each stream should yield an accurate and robust method of segmenting multi-modal data.
Keywords :
hidden Markov models; image segmentation; image sequences; pattern recognition; sensor fusion; video signal processing; audio data sequences; data fusion techniques; hidden Markov models; multimodal audio-visual event recognition; play and break sequences; video data sequences; Artificial intelligence; Games; Hidden Markov models; Machine assisted indexing; Multimedia communication; Project management; Robustness; Streaming media; TV; Tiles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks for Signal Processing, 2003. NNSP'03. 2003 IEEE 13th Workshop on
ISSN :
1089-3555
Print_ISBN :
0-7803-8177-7
Type :
conf
DOI :
10.1109/NNSP.2003.1318046
Filename :
1318046
Link To Document :
بازگشت