DocumentCode :
3017213
Title :
Localization of Multiple Speech Sources Based on Sub-band Steered Response Power
Author :
Cai, Weiping ; Zhao, Xiaoyan ; Wu, Zhenyang
Author_Institution :
Sch. of Inf. Sci. & Eng., Southeast Univ., Nanjing, China
fYear :
2010
fDate :
25-27 June 2010
Firstpage :
1246
Lastpage :
1249
Abstract :
The steered response power with phase transform weighted (SRP-PHAT) is a robust sound source localization method based on microphone array. Multiple source localization has been implemented using SRP-PHAT with agglomerative clustering (AC). In this paper, a novel method of multiple speech source localization based on sub-band SRP is proposed. In this method, speech signal is divided into several sub-bands, where sub-band SRP is computed, initial estimations are generated by searching the maximum in every sub-band SRP, and the final source locations are determined from the initial estimations using AC. The proposed method is tested with the real-world recordings under the condition that the number of active speakers is unknown. The results show that our method provides higher localization performance than that of the conventional SRP-PHAT with AC in the cases with up to 3 concurrent speakers.
Keywords :
acoustic generators; acoustic signal processing; microphone arrays; speech processing; SRP-PHAT; acoustic signal processing; agglomerative clustering; microphone array; multiple speech sources; sound source localization; steered response power with phase transform weighted; Acoustics; Arrays; Clustering algorithms; Estimation; Microphones; Position measurement; Speech; acoustic source localization; clustering; microphone array;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Control Engineering (ICECE), 2010 International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-6880-5
Type :
conf
DOI :
10.1109/iCECE.2010.310
Filename :
5631789
Link To Document :
بازگشت