DocumentCode :
716703
Title :
Binaural sound source localization based on generalized parametric model and two-layer matching strategy in complex environments
Author :
Hong Liu ; Cheng Pang ; Jie Zhang
Author_Institution :
Fac. of Key Lab. of Machine Perception, Peking Univ., Shenzhen, China
fYear :
2015
fDate :
26-30 May 2015
Firstpage :
4496
Lastpage :
4503
Abstract :
Binaural sound source localization is an important technique involving Human-Robot Interaction (HRI), video conference, speech enhancement, etc. In many real application scenarios, especially for closed environments, the affect of reverberation and noise would degrade the precision of position estimations. Therefore, a new binaural sound source localization method based on generalized parametric model and two-layer matching strategy is proposed in this paper for complex environments. Firstly, cepstral prefiltering is utilized for dereverberation of binaural signals. Then, two binaural cues computed from a dual-channel frequency representation, are combined to estimate the azimuths of sources. Additionally, the generalized parametric model is presented to describe the relationship between the azimuth and binaural cues through finding the optimal scaling factors from training data. At last, a two-layer matching strategy based on Bayesian rule is used to make the final decision, which can effectively decrease the computation complexity. Experiments have validated the proposed approach and show that it achieves favorably better results compared with several available methods without extra spacial burden.
Keywords :
acoustic generators; acoustic radiators; human-robot interaction; Bayesian rule; HRI; binaural sound source localization; binaural sound source localization method; closed environments; complex environments; computation complexity; dual channel frequency representation; generalized parametric model; human robot interaction; position estimations; speech enhancement; two layer matching strategy; video conference; Azimuth; Cepstrum; Estimation; Frequency estimation; Parametric statistics; Reverberation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2015 IEEE International Conference on
Conference_Location :
Seattle, WA
Type :
conf
DOI :
10.1109/ICRA.2015.7139822
Filename :
7139822
Link To Document :
بازگشت