• DocumentCode
    3605850
  • Title

    A Time–Frequency Masking Based Random Finite Set Particle Filtering Method for Multiple Acoustic Source Detection and Tracking

  • Author

    Xionghu Zhong ; Hopgood, James R.

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    23
  • Issue
    12
  • fYear
    2015
  • Firstpage
    2356
  • Lastpage
    2370
  • Abstract
    Considering that multiple talkers may appear simultaneously, a time-frequency (TF) masking based random finite set (RFS) particle filtering (PF) method is developed for multiple acoustic source detection and tracking. The time-delay of arrival (TDOA) measurements of multiple sources are extracted by using a time-frequency masking technique, by which each source´s TF bins are clustered and separated in a joint gain-ratio and time-delay histogram. Since a joint detection and tracking problem is considered, both source positions and source numbers are time-varying and need to be estimated. The tracker is built within a RFS Bayesian filtering framework. Essentially, an RFS process is used to characterize the source dynamics that include source appearance/dissappearance and motion trajectories. Latent variables are also introduced to indicate source dynamics and measurement-source associations. Subsequently, a Rao-Blackwellization PF technique is employed so that the source position state can be marginalized and only the latent variables are estimated by using the PF. The main advantage of the proposed approach is that hypothesis-pruning is formulated in a full probabilistic sense. The performance of the proposed approach is demonstrated in real speech recordings as well as in simulated room environments.
  • Keywords
    acoustic signal detection; particle filtering (numerical methods); set theory; source separation; speech intelligibility; time-frequency analysis; time-of-arrival estimation; RFS Bayesian filtering framework; Rao-Blackwellization PF technique; TDOA measurement; TF masking based RFS PF method; joint gain-ratio and time-delay histogram; motion trajectory; multiple acoustic source detection; multiple acoustic source tracking; source separation; speech recording; time-delay of arrival measurement; time-frequency masking based random finite set particle filtering method; Atmospheric measurements; Hidden Markov models; Microphones; Particle measurements; Reverberation; Speech processing; Time of arrival estimation; Acoustic source tracking; particle filtering (PF); random finite set (RFS); room reverberation; time-delay of arrival;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2479041
  • Filename
    7268893