DocumentCode :
2961287
Title :
Audiovisual event detection towards scene understanding
Author :
Canton-Ferrer, C. ; Butko, Taras ; Segura, Carlos ; Giro, X. ; Nadeu, Climent ; Hernando, Juan ; Casas, J.R.
Author_Institution :
Tech. Univ. of Catalonia, Barcelona, Spain
fYear :
2009
fDate :
20-25 June 2009
Firstpage :
81
Lastpage :
88
Abstract :
Acoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a multimodal perspective is presented combining information from multiple cameras and microphones. First, spectral and temporal features are extracted from a single audio channel and spatial localization is achieved by exploiting cross-correlation among microphone arrays. Second, several video cues obtained from multiperson tracking, motion analysis, face recognition, and object detection provide the visual counterpart of the acoustic events to be detected. A multimodal data fusion at score level is carried out using two approaches: weighted mean average and fuzzy integral. Finally, a multimodal database containing a rich variety of acoustic events has been recorded including manual annotations of the data. A set of metrics allow assessing the performance of the presented algorithms. This dataset is made publicly available for research purposes.
Keywords :
audio signal processing; face recognition; motion estimation; object detection; sensor fusion; transforms; video signal processing; acoustic events; audiovisual event detection; face recognition; motion analysis; multi-person tracking; multimodal database; object detection; spectral features; temporal features; weighted mean average; Acoustic signal detection; Cameras; Data mining; Event detection; Feature extraction; Information analysis; Layout; Microphone arrays; Object detection; Tracking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition Workshops, 2009. CVPR Workshops 2009. IEEE Computer Society Conference on
Conference_Location :
Miami, FL
ISSN :
2160-7508
Print_ISBN :
978-1-4244-3994-2
Type :
conf
DOI :
10.1109/CVPRW.2009.5204264
Filename :
5204264
Link To Document :
بازگشت