DocumentCode :
3029484
Title :
Multi-modal fusion with particle filter for speaker localization and tracking
Author :
Heuer, Michael ; Al-Hamadi, Ayoub ; Michaelis, Bernd ; Wendemuth, Andreas
Author_Institution :
Inst. for Electron., Signal Process. & Commun., Otto-von-Guericke Univ. of Magdeburg, Magdeburg, Germany
fYear :
2011
fDate :
26-28 July 2011
Firstpage :
6450
Lastpage :
6453
Abstract :
This paper describes a methodology for fusing multimodal data meaningful together, in order to detect and track a speaker with a conventional sensor setup. We use Gaussian mixtures to combine the sensor information within a particle filter, such that a single speaker can be identified in the presence of multiple visual observations. The major advantages are design considerations that let the system perform in real time, while using an easily extensible framework. Besides, we highly reduce noise which gives us a more dependable prediction. Results illustrate the localization estimations in a two- and a three-person scenario.
Keywords :
particle filtering (numerical methods); signal denoising; speaker recognition; Gaussian mixtures; conventional sensor setup; localization estimations; multimodal data fusion; multimodal fusion; multiple visual observations; noise reduction; particle filter; speaker localization; speaker tracking; Computational modeling; Feature extraction; Image color analysis; Image segmentation; Particle filters; Skin; Streaming media; Computer Vision; Data Fusion; Pattern Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Technology (ICMT), 2011 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-61284-771-9
Type :
conf
DOI :
10.1109/ICMT.2011.6002028
Filename :
6002028
Link To Document :
بازگشت