Title :
GMM-based binaural localization of sound sources in both simulated and real rooms
Author :
Zhang Chun-lei ; Zeng Xiang-yang ; Zhang Gui-min
Author_Institution :
Coll. of Marine, Northwestern Polytech. Univ., Xi´an, China
Abstract :
This paper describes a system that is able to localize and detect sound sources in the presence of interfering noise and reverberation from the stereo sound recording. First, a computational auditory scene analysis (CASA) based binaural front-end is applied to generate the binaural cues, interaural time and level differences (ITDs & ILDs). Second, based on the probabilistic nature of the binaural cues, a combination of ITDs and ILDs as the binaural feature space is modeled by Gaussian mixture models (GMMs) to compute the probability density functions (PDFs) of time-frequency units. Speech source localization was determined by a Bayesian maximum a posterior (MAP). Third, binary mask is estimated after Bayesian analysis to detect the speech. For evaluating the performance of this proposed system, both simulated acoustic condition and real rooms are applied in the evaluation stage. The results show that our proposed method achieves a good speech localization performance.
Keywords :
Bayes methods; Gaussian processes; Hi-Fi equipment; maximum likelihood estimation; speech processing; Bayesian maximum a posterior; GMM; Gaussian mixture model; MAP; binaural cue; binaural feature space; binaural localization; computational auditory scene analysis; interaural time; interfering noise; level difference; probability density functions; sound source; speech source localization; stereo sound recording; time-frequency unit; Accuracy; Azimuth; Computational modeling; Noise; Receivers; Reverberation; Speech; Gaussian Mixture Models (GMMs); ILD; ITD; Localization; computational auditory scene analysis (CASA);
Conference_Titel :
Signal Processing, Communication and Computing (ICSPCC), 2013 IEEE International Conference on
Conference_Location :
KunMing
DOI :
10.1109/ICSPCC.2013.6664023