DocumentCode :
3632053
Title :
Information fusion techniques in Audio-Visual Speech Recognition
Author :
H. Karabalkan;H. Erdogan
Author_Institution :
M?hendislik ve Do?a Bilimleri Fak?ltesi, Sabanc? ?niversitesi, Turkey
fYear :
2009
fDate :
4/1/2009 12:00:00 AM
Firstpage :
504
Lastpage :
507
Abstract :
It is well known that human perception of speech relies both on audio and visual information. However, the physiology of information fusion process in humans is still indefinite which attracts scientists´ attention to information fusion process for audio-visual speech recognition. In this work, a novel tandem hybrid approach is introduced for an efficient audio-visual speech recognition system and the performance of the proposed technique is experimentally compared with the widely used Multiple Stream Hidden Markov Model (MSHMM) approach.
Keywords :
"Speech recognition","Hidden Markov models","Mel frequency cepstral coefficient","Discrete cosine transforms","Telecommunication standards","Streaming media","Humans","Linear discriminant analysis","Physiology","Gaussian processes"
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications Conference, 2009. SIU 2009. IEEE 17th
ISSN :
2165-0608
Print_ISBN :
978-1-4244-4435-9
Type :
conf
DOI :
10.1109/SIU.2009.5136443
Filename :
5136443
Link To Document :
بازگشت