DocumentCode :
493778
Title :
Two microphone based direction of arrival estimation for multiple speech sources using spectral properties of speech
Author :
Zhang, Wenyi ; Rao, Bhaskar D.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, San Diego, CA
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
2193
Lastpage :
2196
Abstract :
A two microphone direction of arrival (DOA) estimation technique for multiple speech sources is developed which exploits speech specific properties, namely sparsity in time-frequency (spectrum) domain. For robustness, we exploit the sparsity in the frequency domain by focusing on the spectral content concentrated in sinusoidal tracks obtained through sinusoidal modeling. When multiple speeches are mixed in the two microphone system, the inter-channel phase differences (IPD) between the dual channels on those sinusoidal tracks will be dominated by the spatial information of the most powerful source at that specific time-frequency point because of the spectrum sparsity and masking effects. Thereby, the source localization problem is turned into a clustering problem on the IPD versus frequency plot, and the generalized mixture decomposition algorithm (GMDA) is used to cluster the groups of points corresponding to multiple sources. The DOA of each source is derived from the parameters of each cluster. Experimental results conducted show the scheme to be very effective.
Keywords :
direction-of-arrival estimation; spectral analysis; speech processing; time-frequency analysis; generalized mixture decomposition algorithm; inter-channel phase differences; masking effects; microphone-based direction-of-arrival estimation; multiple speech sources; sinusoidal modeling; source localization problem; spectral speech properties; spectrum domain; spectrum sparsity; time-frequency domain; Clustering algorithms; Direction of arrival estimation; Frequency domain analysis; Microphones; Power system modeling; Robustness; Speech analysis; Speech coding; Streaming media; Time frequency analysis; Two microphone system; direction of arrival estimation; generalized mixture decomposition algorithm; sinusoidal modeling; sparsity; speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960053
Filename :
4960053
Link To Document :
بازگشت