• DocumentCode
    2332757
  • Title

    Reduced Complexity and Scaling for Asynchronous HMMS in a Bimodal Input Fusion Application

  • Author

    Al-Hames, Marc ; Rigoll, Gerhard

  • Author_Institution
    Inst. for Human-Machine Commun., Technische Univ. Munchen
  • Volume
    5
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    The asynchronous hidden Markov model (AHMM) can model the joint likelihood of two observation sequences, even if the streams are not synchronised. Previously this model has been applied to audio-visual recognition tasks. The main drawback of the concept is its rather high training and decoding complexity. In this work we show how the complexity can be reduced significantly with advanced running indices for the calculations. Yet, the AHMM characteristics and its advantages are preserved. The improvement also allows a scaling procedure to keep numerical values in a reasonable range. In an experimental section we compare the complexity of the original and the improved concept and validate the theoretical results. Then the model is tested on a bimodal speech and gesture user input fusion task: compared to a late fusion HMM an improvement of more than 10% absolute recognition performance has been achieved
  • Keywords
    audio-visual systems; computational complexity; decoding; gesture recognition; hidden Markov models; image coding; speech coding; speech recognition; asynchronous HMM; asynchronous hidden Markov model; audio-visual recognition tasks; bimodal input fusion application; bimodal speech; decoding complexity; gesture user input fusion task; Automatic speech recognition; Computational modeling; Decoding; Hidden Markov models; Man machine systems; Speech analysis; Speech recognition; Streaming media; Switches; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1661386
  • Filename
    1661386