DocumentCode :
2854804
Title :
Talker localization in a real acoustic environment based on DOA estimation and statistical sound source identification
Author :
Nishiura, Takanobu ; Nakamura, Satoshi ; Shikano, Kiyohiro
Author_Institution :
Faculty of Systems Engineering Wakayama University, 930 Sakaedani, 640-8510 Japan
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
For a hands-free speech interface, it is very important to capture distant talking speech with high quality. A microphone array is an ideal candidate for this purpose. However, this approach requires localizing the target talker. Conventional talker localization algorithms in multiple sound source environments not only have difficulty localizing the multiple sound sources accurately, but also have difficulty localizing the target talker among known multiple sound source positions. To cope with these problems, we propose a new talker localization algorithm consisting of two algorithms. One is DOA (Direction Of Arrival) estimation algorithm for multiple sound source localization based on CSP (Cross-power Spectrum Phase) coefficient addition method. The other is statistical sound source identification algorithm based on GMM (Gaussian Mixture Model) for localizing the target talker position among localized multiple sound sources. In this paper, we particularly focus on the talker localization performance based on the combination of these two algorithms with a microphone array.
Keywords :
Arrays; Direction of arrival estimation; Estimation; Mel frequency cepstral coefficient; Noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743883
Filename :
5743883
Link To Document :
بازگشت