• DocumentCode
    2233868
  • Title

    A Multi-Modal Mixed-State Dynamic Bayesian Network for Robust Meeting Event Recognition from Disturbed Data

  • Author

    Al-Hames, Marc ; Rigoll, Gerhard

  • Author_Institution
    Inst. for Human-Machine Commun., Technische Univ. Munchen
  • fYear
    2005
  • fDate
    6-6 July 2005
  • Firstpage
    45
  • Lastpage
    48
  • Abstract
    In this work we present a novel multi-modal mixed-state dynamic Bayesian network (DBN) for robust meeting event classification. The model uses information from lapel microphones, a microphone array and visual information to structure meetings into segments. Within the DBN a multi-stream hidden Markov model (HMM) is coupled with a linear dynamical system (LDS) to compensate disturbances in the data. Thereby the HMM is used as driving input for the LDS. The model can handle noise and occlusions in all channels. Experimental results on real meeting data show that the new model is highly preferable to all single-stream approaches. Compared to a baseline multi-modal early fusion HMM, the new DBN is more than 2.5%, respectively 1.5% better for clear and disturbed data, this corresponds to a relative error reduction of 17%, respectively 9%
  • Keywords
    belief networks; hidden Markov models; image classification; microphone arrays; DBN; HMM; LDS; channel occlusion; data distribution; dynamic Bayesian network; linear dynamical system; meeting event classification; microphone array; multimodal mixed-state network; multistream hidden Markov model; visual information; Ambient intelligence; Bayesian methods; Cameras; Hidden Markov models; Legged locomotion; Man machine systems; Microphone arrays; Robustness; Speech analysis; Videos;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
  • Conference_Location
    Amsterdam
  • Print_ISBN
    0-7803-9331-7
  • Type

    conf

  • DOI
    10.1109/ICME.2005.1521356
  • Filename
    1521356