Title :
Robust Multi-Modal Group Action Recognition in Meetings from Disturbed Videos with the Asynchronous Hidden Markov Model
Author :
Al-Hames, Marc ; Lenz, Claus ; Reiter, Stephan ; Schenk, Joachim ; Wallhoff, Frank ; Rigoll, Gerhard
Author_Institution :
Tech. Univ. Munchen, Munchen
fDate :
Sept. 16 2007-Oct. 19 2007
Abstract :
The asynchronous hidden Markov model (AHMM) models the joint likelihood of two observation sequences, even if the streams are not synchronised. We explain this concept and how the model is trained by the EM algorithm. We then show how the AHMM can be applied to the analysis of group action events in meetings from both clear and disturbed data. The AHMM outperforms an early fusion HMM by 5.7% recognition rate (a rel. error reduction of 38.5%) for clear data. For occluded data, the improvement is in average 6.5% recognition rate (rel. error red. 40%). Thus asynchronity is a dominant factor in meeting analysis, even if the data is disturbed. The AHMM exploits this and is therefore much more robust against disturbances.
Keywords :
expectation-maximisation algorithm; hidden Markov models; video signal processing; asynchronous hidden Markov model; expectation-maximisation algorithm; meetings; multimedia communication; multimodal group action recognition; video signal processing; Cameras; Hidden Markov models; Legged locomotion; Man machine systems; Microphone arrays; Organizational aspects; Robustness; Signal processing algorithms; Speech analysis; Videos; Hidden Markov models; Meetings; Multimedia communication; Robustness; Video signal processing;
Conference_Titel :
Image Processing, 2007. ICIP 2007. IEEE International Conference on
Conference_Location :
San Antonio, TX
Print_ISBN :
978-1-4244-1437-6
Electronic_ISBN :
1522-4880
DOI :
10.1109/ICIP.2007.4379130