Title :
Role of head pose estimation in speech acquisition from distant microphones
Author :
Shivappa, Shankar T. ; Rao, Bhaskar D. ; Trivedi, Mohan M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, La Jolla, CA
Abstract :
Reverberant environments pose a challenge to speech acquisition from distant microphones. Approaches using microphone arrays have met with limited success. Recent research using audio-visual sensors for tasks such as speaker localization has shown improvement over traditional audio-only approaches. Using computer vision techniques we can estimate the orientation of the speaker´s head in addition to the location of the speaker. In this paper we study the utility of using the head pose information for effective beamforming and clean speech acquisition from distant microphones. The improvements in speech recognition accuracy relative to that of a close talking microphone are presented and the results provide sufficient motivation for incorporating head pose information in beamforming techniques.
Keywords :
array signal processing; audio-visual systems; computer vision; microphones; pose estimation; speech enhancement; speech recognition; audio-visual sensor; beamforming; computer vision; head pose estimation; microphone; reverberant environment; speaker localization; speech acquisition; speech enhancement; speech recognition; Array signal processing; Computer vision; Delay; Drives; Loudspeakers; Microphone arrays; Speech enhancement; Speech recognition; Transfer functions; Vocabulary; Speech enhancement; audio-visual fusion; head-pose estimation; human-computer interface; intelligent spaces; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960394