Title :
Binaural speech segregation based on pitch and azimuth tracking
Author :
Woodruff, John ; Wang, DeLiang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
We propose an approach to binaural speech segregation in reverberation based on pitch and azimuth cues. These cues are integrated within a statistical tracking framework to estimate up to two concurrent pitch frequencies and three concurrent azimuth angles. The tracking framework implicitly estimates binary time-frequency masks by solving a data association problem, thereby performing speech segregation. Experimental results show that the proposed approach compares favorably to existing two-microphone systems in spite of less prior information. The benefit of the proposed approach is most pronounced in conditions with substantial reverberation or for closely spaced sources.
Keywords :
frequency estimation; microphones; reverberation; sensor fusion; speech processing; statistical analysis; time-frequency analysis; azimuth cues; azimuth tracking; binary time-frequency mask estimation; binaural speech segregation; concurrent azimuth angles; data association problem; pitch cues; pitch tracking; reverberation; statistical tracking framework; two-microphone systems; Azimuth; Hidden Markov models; Mathematical model; Microphones; Signal to noise ratio; Speech; Time frequency analysis; Computational auditory scene analysis; binaural localization; multipitch tracking; speech segregation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6287862