DocumentCode :
3097249
Title :
Experimental Performance Assessment of a Particle Filter with Voice Activity Data Fusion for Acoustic Speaker Tracking
Author :
Lehmann, Eric A. ; Johansson, Anders M.
Author_Institution :
Western Australia Telecommun. Res. Inst., Perth, WA
fYear :
2006
fDate :
7-9 June 2006
Firstpage :
126
Lastpage :
129
Abstract :
The problem of acoustic source localization and tracking (ASLT) in reverberant environments by means of a microphone array constitutes a challenging task from many viewpoints. One of the main issues when considering real-world situations involving human speakers is the presence of silence gaps in the speech, which can easily send the tracking algorithm off-track, even in practical environments with low to moderate noise and reverberation levels. This work is concerned with an implementation of the ASLT algorithm proposed in E. Lehmann et al., which circumvents this problem by integrating measurements from a voice activity detector (VAD) within the tracking algorithm framework. The tracking performance of this method is tested experimentally using audio data recorded in a real reverberant room. To this purpose, we describe a quick and efficient way of determining the ground-truth speaker location versus time, an operation that is not always easy to perform. The experimental results confirm the improved robustness of the method presented in E. Lehmann et al., (compared to a previously proposed non-VAD ASLT algorithm) when tracking sources emitting real-world speech signals, which typically involve significant silence gaps between utterances
Keywords :
acoustic signal processing; filtering theory; sensor fusion; speech processing; ASLT; VAD; acoustic source localization-tracking; acoustic speaker tracking; data fusion; microphone array; particle filter performance assessment; real-world speech signals; reverberant environments; voice activity detector; Acoustic arrays; Humans; Loudspeakers; Microphone arrays; Noise level; Particle filters; Particle tracking; Reverberation; Speech enhancement; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Symposium, 2006. NORSIG 2006. Proceedings of the 7th Nordic
Conference_Location :
Rejkjavik
Print_ISBN :
1-4244-0412-6
Electronic_ISBN :
1-4244-0413-4
Type :
conf
DOI :
10.1109/NORSIG.2006.275293
Filename :
4052288
Link To Document :
بازگشت