DocumentCode :
2855852
Title :
Talker localization based on the combination of DOA estimation and statistical sound source identification with microphone array
Author :
Nishiura, Takariobu ; Nakamura, Satoshi
Author_Institution :
ATR Spoken Language Translation Res. Laboratories, Kyoto, Japan
fYear :
2003
fDate :
28 Sept.-1 Oct. 2003
Firstpage :
597
Lastpage :
600
Abstract :
It is very important for a hands-free speech interface to capture distant talking speech with high quality. A microphone array is an ideal candidate for this purpose. However, this approach requires localizing the target talker. Conventional talker localization algorithms in multiple sound source environments not only have difficulty localizing the multiple sound sources accurately, but also have difficulty localizing the target talker among known multiple sound source positions. To cope with these problems, we propose a new talker localization algorithm consisting of two algorithms. One is DOA (direction of arrival) estimation algorithm for multiple sound source localization based on CSP (cross-power spectrum phase) coefficient addition method. The other is statistical sound source identification algorithm based on GMM (Gaussian mixture model) for localizing the target talker position among localized multiple sound sources. In this paper, we particularly focus on the evaluation of the talker localization algorithm based on the combination of these two algorithms with a microphone array.
Keywords :
Gaussian processes; array signal processing; direction-of-arrival estimation; microphones; speech recognition; DOA estimation; Gaussian mixture model; conventional talker localization; cross-power spectrum phase; direction of arrival; distant talking speech; hands-free speech interface; microphone array; sound source identification; sound sources; target talker; Acoustical engineering; Automatic speech recognition; Direction of arrival estimation; Laboratories; Microphone arrays; Natural languages; Phase estimation; Signal processing; Speech enhancement; Systems engineering and theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Statistical Signal Processing, 2003 IEEE Workshop on
Print_ISBN :
0-7803-7997-7
Type :
conf
DOI :
10.1109/SSP.2003.1289547
Filename :
1289547
Link To Document :
بازگشت