Title :
Smart room: participant and speaker localization and identification
Author :
Busso, Carlos ; Hernanz, Sergi ; Chu, Chi-Wei ; Kwon, Soon-Il ; Lee, Sung ; Georgiou, Panayiotis G. ; Cohen, Isaac ; Narayanan, Shrikanth
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
Abstract :
Our long-term objective is to create smart room technologies that are aware of the users presence and their behavior and can become an active, but not an intrusive, part of the interaction. In this work, we present a multimodal approach for estimating and tracking the location and identity of the participants including the active speaker. Our smart room design contains three user-monitoring systems: four CCD cameras, an omnidirectional camera and a 16 channel microphone array. The various sensory modalities are processed both individually and jointly and it is shown that the multimodal approach results in significantly improved performance in spatial localization, identification and speech activity detection of the participants.
Keywords :
array signal processing; audio signal processing; face recognition; multimedia systems; speaker recognition; video signal processing; CCD cameras; active speaker localization; face detection; microphone array; multimodal method; omnidirectional camera; participant identification; participant localization; smart room technology; speaker identification; speech activity detection; Acoustic signal detection; Charge coupled devices; Charge-coupled image sensors; Filtering; Firewire; Humans; Indexing; Loudspeakers; Microphone arrays; Smart cameras;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
Print_ISBN :
0-7803-8874-7
DOI :
10.1109/ICASSP.2005.1415605