DocumentCode
1294736
Title
An Acoustic Source Localization and Tracking Framework Using Particle Filtering and Information Theory
Author
Talantzis, Fotios
Author_Institution
Autonomic Comput. Group, Athens Inf. Technol. Inst., Athens, Greece
Volume
18
Issue
7
fYear
2010
Firstpage
1806
Lastpage
1817
Abstract
The problem of detecting the location of an active acoustic source in an enclosure remains subject to a series of difficulties. Algorithms operate repeatedly on small frames of data from microphone recordings and provide estimates of the current source location. In a typical room, the quality of these recordings is affected by noise and reverberation. Additionally, the presence of silence gaps in speech signals and possible competing speakers can reduce tracking accuracy further. We discuss a novel localization and tracking framework that is based on particle filtering. This is driven by detection methods based on information theory that remain robust under reverberant and noisy environments. Integrating a second particle filter allows the system to track interchanging acoustic sources that reside far apart. A further extension involves the integration of a voice activity detection scheme that uses the same detection measures and deals with human-speech gaps. Performance is first examined using simulations that parameterize results according to environmental variables like reverberation and system geometry. The system is then used in a real-world scenario with data from multi-person meetings. Results indicate that the proposed framework outperforms all systems used for comparison in this work while remaining adequately robust in the examined environments.
Keywords
information theory; particle filtering (numerical methods); radio tracking; acoustic source localization; information theory; particle filtering; system geometry; tracking framework; Microphones; Noise; Position measurement; Reverberation; Robustness; Speech; Acoustic arrays; acoustic tracking; information theory;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2010.2052248
Filename
5547560
Link To Document