• DocumentCode
    19155
  • Title

    Bayesian Nonparametrics for Microphone Array Processing

  • Author

    Otsuka, Takayuki ; Ishiguro, Katsuhiko ; Sawada, Hideyuki ; Okuno, Hiroshi G.

  • Author_Institution
    Grad. Sch. of Inf., Kyoto Univ., Kyoto, Japan
  • Volume
    22
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    493
  • Lastpage
    504
  • Abstract
    Sound source localization and separation from a mixture of sounds are essential functions for computational auditory scene analysis. The main challenges are designing a unified framework for joint optimization and estimating the sound sources under auditory uncertainties such as reverberation or unknown number of sounds. Since sound source localization and separation are mutually dependent, their simultaneous estimation is required for better and more robust performance. A unified model is presented for sound source localization and separation based on Bayesian nonparametrics. Experiments using simulated and recorded audio mixtures show that a method based on this model achieves state-of-the-art sound source separation quality and has more robust performance on the source number estimation under reverberant environments.
  • Keywords
    array signal processing; audio signal processing; microphone arrays; source separation; Bayesian nonparametrics; audio mixture recording; auditory uncertainties; computational auditory scene analysis; joint optimization; microphone array processing; reverberant environment; sound mixture; sound source estimation; sound source localization; sound source separation; source number estimation; source separation quality; unified model; Arrays; Bayes methods; Estimation; Microphones; Source separation; Time-frequency analysis; Vectors; Audio source separation and enhancement (AUD-SSEN); Bayesian nonparametrics; blind source separation; microphone array processing; sound source localization; spatial and multichannel audio (AUD-SMCA); time-frequency masking;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2013.2294582
  • Filename
    6680684