• DocumentCode
    1083496
  • Title

    Binaural Tracking of Multiple Moving Sources

  • Author

    Roman, Nicoleta ; Wang, DeLiang

  • Author_Institution
    Dept. of Math., Ohio State Univ., Lima, OH
  • Volume
    16
  • Issue
    4
  • fYear
    2008
  • fDate
    5/1/2008 12:00:00 AM
  • Firstpage
    728
  • Lastpage
    739
  • Abstract
    This paper addresses the problem of tracking multiple moving sources using binaural input. We observe that binaural cues are strongly correlated with source locations in time-frequency regions dominated by only one source. Based on this observation, we propose a novel tracking algorithm that integrates probabilities across reliable frequency channels in order to produce a likelihood function in the target space, which describes the azimuths of all active sources at a particular time frame. Finally, a hidden Markov model (HMM) is employed to form continuous tracks and automatically detect the number of active sources across time. Results are presented for up to three moving talkers in anechoic conditions. A comparison shows that our HMM model outperforms a Kalman filter-based approach in tracking active sources across time. Our study represents a first step in addressing auditory scene analysis with moving sound sources.
  • Keywords
    Kalman filters; hidden Markov models; tracking filters; Kalman filter; active source tracking; anechoic condition; auditory scene analysis; binaural cues; binaural tracking; hidden Markov model; likelihood function; moving sound source; moving talker; multiple moving source tracking; reliable frequency channel; tracking algorithm; Binaural processing; hidden Markov model (HMM); moving source tracking; multisource tracking;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.918978
  • Filename
    4457928