Title :
Audio constrained particle filter based visual tracking
Author :
Kilic, Volkan ; Barnard, Mark ; Wenwu Wang ; Kittler, Josef
Author_Institution :
Centre for Vision, Speech & Signal Process., Univ. of Surrey, Guildford, UK
Abstract :
We present a robust and efficient audio-visual (AV) approach to speaker tracking in a room environment. A challenging problem with visual tracking is to deal with occlusions (caused by the limited field of view of cameras or by other speakers). Another challenge is associated with the particle filtering (PF) algorithm, commonly used for visual tracking, which requires a large number of particles to ensure the distribution is well modelled. In this paper, we propose a new method of fusing audio into the PF based visual tracking. We use the direction of arrival angles (DOAs) of the audio sources to reshape the typical Gaussian noise distribution of particles in the propagation step and to weight the observation model in the measurement step. Experiments on AV16.3 datasets show the advantage of our proposed method over the baseline PF method for tracking occluded speakers with a significantly reduced number of particles.
Keywords :
Gaussian distribution; audio signal processing; direction-of-arrival estimation; particle filtering (numerical methods); speaker recognition; target tracking; AV approach; AV16.3 datasets; DOA; Gaussian noise distribution; audio constrained particle filter based visual tracking; audio fusion; audiovisual approach; baseline PF method; direction of arrival angles; measurement step; observation model; propagation step; room environment; speaker tracking; Atmospheric measurements; Cameras; Direction-of-arrival estimation; Particle measurements; Robustness; Signal processing algorithms; Visualization; DOAs; Particle filter; visual tracking;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6638334