DocumentCode :
2819367
Title :
Voice source localization for automatic camera pointing system in videoconferencing
Author :
Wang, Hong ; Chu, Peter
Author_Institution :
PictureTel Corp., Andover, MA, USA
fYear :
1997
fDate :
19-22, Oct 1997
Abstract :
This paper describes the voice source localization algorithm used in the PictureTel automatic camera pointing system (LimeLightTM , dynamic speech locating technology). The system uses an array of 46 cm wide and 30 cm high, which contains 4 microphones, and is mounted on top of the monitor. The three dimensional position of a sound source is calculated from the time delays of 4 pairs of microphones. In time delay estimation, the averaging of signal onsets of each frequency band is combined with phase correlation to reduce the influence of noise and reverberation. With this approach, it is possible to provide reliable three dimensional voice source localization by a small microphone array. Post processing based on a priori knowledge is also introduced to eliminate the influences of reflections from furniture such as tables. Results of speech source localization under real conference room conditions are given. Some system related issues are also discussed
Keywords :
acoustic signal detection; acoustic signal processing; acoustic wave reflection; array signal processing; correlation methods; delays; direction-of-arrival estimation; microphones; noise; reverberation; speech processing; teleconferencing; video cameras; 3D position; 3D voice source localization; LimeLight; PictureTel automatic camera pointing system; conference room conditions; dynamic speech locating technology; frequency band; furniture; microphone array; noise; phase correlation; post processing; reflections; reverberation; signal onsets averaging; sound source; speech source localization; tables; time delay estimation; videoconferencing; voice source localization algorithm; Acoustic noise; Cameras; Delay effects; Delay estimation; Frequency estimation; Microphone arrays; Monitoring; Noise reduction; Phase estimation; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 1997. 1997 IEEE ASSP Workshop on
Conference_Location :
New Paltz, NY
Print_ISBN :
0-7803-3908-8
Type :
conf
DOI :
10.1109/ASPAA.1997.625639
Filename :
625639
Link To Document :
بازگشت