Title :
A dictionary learning approach to tracking
Author :
Barnard, Mark ; Wang, Wenwu ; Kittler, Josef ; Naqvi, Syed Mohsen ; Chambers, Jonathon A.
Author_Institution :
Centre for Vision, Speech & Signal Process., Univ. of Surrey, Guildford, UK
Abstract :
The problem of tracking people using multiple cameras is of much current interest as a means of providing cues for audio-visual blind source separation in dynamic environments. Here we investigate the use of one of the current state-of-the-art techniques in object recognition combined with one of the most popular methods of modelling object motion, particle filters, for tracking people. The dictionary learning or Bag-of-Words approach to object recognition has proved to be very effective in recent years, as shown in a number of large comparisons such as the PASCAL Visual Object recognition Challenge (VOC). In this paper we use this proven object recognition method within the framework of a particle filter. This provides a more accurate and robust tracking of people in a multiple camera environment. We also demonstrate that the dictionary learning approach can provide a principled method for the fusion of multiple features.
Keywords :
audio-visual systems; blind source separation; object recognition; particle filtering (numerical methods); PASCAL visual object recognition challenge; audio-visual blind source separation; bag-of-words approach; dictionary learning; dynamic environments; multiple cameras; multiple features; object motion; particle filters; people tracking; Adaptation models; Dictionaries; Histograms; Support vector machines; Training; Training data; Visualization; Dictionary Learning; Tracking;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288049