• DocumentCode
    2811650
  • Title

    Integrating monaural and binaural analysis for localizing multiple reverberant sound sources

  • Author

    Woodruff, John ; Wang, DeLiang

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    2706
  • Lastpage
    2709
  • Abstract
    Localization of simultaneous sound sources in natural environments with only two microphones is a challenging problem. Reverberation degrades performance of localization based exclusively on directional cues. We present an approach that integrates monaural and binaural analysis to improve localization of multiple speech sources in noisy and reverberant environments. Our approach incorporates pitch-based monaural processing to perform simultaneous organization of voiced speech. We propose a probabilistic framework to jointly perform localization and sequential organization using binaural cues. We evaluate our system on multi-source speech mixtures in the presence of reverberation and diffuse noise and compare it to two localization approaches that do not incorporate monaural cues. Results indicate that our system can accurately localize multiple sources in very challenging conditions.
  • Keywords
    microphones; probability; speech processing; binaural analysis; computational auditory scene analysis; microphone; monaural analysis; multiple reverberant sound source; multiple speech source; pitch-based monaural processing; probabilistic framework; Acoustic noise; Azimuth; Computer science; Degradation; Filters; Frequency estimation; Microphones; Reverberation; Speech analysis; Working environment noise; Binaural sound localization; computational auditory scene analysis; monaural grouping; sequential organization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5496242
  • Filename
    5496242