DocumentCode
1083496
Title
Binaural Tracking of Multiple Moving Sources
Author
Roman, Nicoleta ; Wang, DeLiang
Author_Institution
Dept. of Math., Ohio State Univ., Lima, OH
Volume
16
Issue
4
fYear
2008
fDate
5/1/2008 12:00:00 AM
Firstpage
728
Lastpage
739
Abstract
This paper addresses the problem of tracking multiple moving sources using binaural input. We observe that binaural cues are strongly correlated with source locations in time-frequency regions dominated by only one source. Based on this observation, we propose a novel tracking algorithm that integrates probabilities across reliable frequency channels in order to produce a likelihood function in the target space, which describes the azimuths of all active sources at a particular time frame. Finally, a hidden Markov model (HMM) is employed to form continuous tracks and automatically detect the number of active sources across time. Results are presented for up to three moving talkers in anechoic conditions. A comparison shows that our HMM model outperforms a Kalman filter-based approach in tracking active sources across time. Our study represents a first step in addressing auditory scene analysis with moving sound sources.
Keywords
Kalman filters; hidden Markov models; tracking filters; Kalman filter; active source tracking; anechoic condition; auditory scene analysis; binaural cues; binaural tracking; hidden Markov model; likelihood function; moving sound source; moving talker; multiple moving source tracking; reliable frequency channel; tracking algorithm; Binaural processing; hidden Markov model (HMM); moving source tracking; multisource tracking;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2008.918978
Filename
4457928
Link To Document