• DocumentCode
    3632053
  • Title

    Information fusion techniques in Audio-Visual Speech Recognition

  • Author

    H. Karabalkan;H. Erdogan

  • Author_Institution
    M?hendislik ve Do?a Bilimleri Fak?ltesi, Sabanc? ?niversitesi, Turkey
  • fYear
    2009
  • fDate
    4/1/2009 12:00:00 AM
  • Firstpage
    504
  • Lastpage
    507
  • Abstract
    It is well known that human perception of speech relies both on audio and visual information. However, the physiology of information fusion process in humans is still indefinite which attracts scientists´ attention to information fusion process for audio-visual speech recognition. In this work, a novel tandem hybrid approach is introduced for an efficient audio-visual speech recognition system and the performance of the proposed technique is experimentally compared with the widely used Multiple Stream Hidden Markov Model (MSHMM) approach.
  • Keywords
    "Speech recognition","Hidden Markov models","Mel frequency cepstral coefficient","Discrete cosine transforms","Telecommunication standards","Streaming media","Humans","Linear discriminant analysis","Physiology","Gaussian processes"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference, 2009. SIU 2009. IEEE 17th
  • ISSN
    2165-0608
  • Print_ISBN
    978-1-4244-4435-9
  • Type

    conf

  • DOI
    10.1109/SIU.2009.5136443
  • Filename
    5136443