Title :
Combination of nested microphone array and subband processing for multiple simultaneous speaker localization
Author :
Firoozabadi, Ali Dehghan ; Abutalebi, H.R.
Author_Institution :
Electr. & Comput. Eng. Dept., Yazd Univ., Yazd, Iran
Abstract :
Speaker localization is one of the active topics in speech processing field. In this paper, we use a two-step method based on Time Difference Of Arrival (TDOA) for the localization of multiple simultaneous speech sources. In this method, directions of speakers are estimated by computing Generalized Cross Correlation (GCC) between microphone signals. In this paper, we propose a method based on combination of subband processing and nested microphone arrays. The use of subband processing is effective in increasing accuracy of multiple speaker localization. Also, the nested array can remove spatial aliasing by intelligent selection of some microphone subsets and assigning them to different subbands. When microphones of each subband were determined, subband processing is just applied on the data from that microphone subset. Moreover, targeting the high-noise environmental conditions, we use the GCC-Maximum Likelihood (GCC-ML) as the localization core of the proposed method. The combination of these all leads to omitting spatial aliasing and increasing the localization accuracy. Simulation results on different environmental scenarios validate the superior performance of the proposed method in the localization of multiple simultaneous speakers.
Keywords :
correlation methods; direction-of-arrival estimation; maximum likelihood estimation; microphone arrays; speaker recognition; GCC-ML; GCC-maximum likelihood; TDOA; generalized cross correlation; microphone signal; nested microphone array; simultaneous speaker localization; simultaneous speech source localization; spatial aliasing; speaker direction estimation; speech processing; subband processing; time difference of arrival; two-step method; Arrays; Finite impulse response filters; Histograms; Microphones; Noise measurement; Reverberation; Speech; Direction Of Arrival (DOA); Nested Microphone Array; Speech Source Localization; Subband Processing; Time Difference Of Arrival (TDOA);
Conference_Titel :
Telecommunications (IST), 2012 Sixth International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4673-2072-6
DOI :
10.1109/ISTEL.2012.6483115