DocumentCode :
698468
Title :
Audio-visual speech recognition with a hybrid SVM-HMM system
Author :
Gurban, Mihai ; Thiran, Jean-Philippe
Author_Institution :
Signal Process. Inst., Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland
fYear :
2005
fDate :
4-8 Sept. 2005
Firstpage :
1
Lastpage :
4
Abstract :
Traditional speech recognition systems use Gaussian mixture models to obtain the likelihoods of individual phonemes, which are then used as state emission probabilities in hidden Markov models representing the words. In hybrid systems, the Gaussian mixtures are replaced by more discriminant classifiers, leading to an improved performance. Most of the time the classifiers used in such systems are neural networks. Support vector machines have also been used in one-modality audio or visual speech recognition, but never in a multimodal audio-visual system. We propose such a hybrid SVM-HMM speech recognizer, and we show how the multimodal approach leads to better performance than that obtained with any of the two modalities individually.
Keywords :
audio-visual systems; hidden Markov models; neural nets; signal classification; speech recognition; support vector machines; audio-visual speech recognition; discriminant classifiers; hidden Markov models; hybrid SVM-HMM system; individual phoneme likelihoods; neural networks; state emission probabilities; support vector machines; Accuracy; Hidden Markov models; Signal to noise ratio; Speech; Speech recognition; Support vector machines; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2005 13th European
Conference_Location :
Antalya
Print_ISBN :
978-160-4238-21-1
Type :
conf
Filename :
7078053
Link To Document :
بازگشت