مرکز منطقه ای اطلاع رساني علوم و فناوري - Audio-visual speech recognition with a hybrid SVM-HMM system

DocumentCode :

698468

Title :

Audio-visual speech recognition with a hybrid SVM-HMM system

Author :

Gurban, Mihai ; Thiran, Jean-Philippe

Author_Institution :

Signal Process. Inst., Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland

fYear :

2005

fDate :

4-8 Sept. 2005

Firstpage :

Lastpage :

Abstract :

Traditional speech recognition systems use Gaussian mixture models to obtain the likelihoods of individual phonemes, which are then used as state emission probabilities in hidden Markov models representing the words. In hybrid systems, the Gaussian mixtures are replaced by more discriminant classifiers, leading to an improved performance. Most of the time the classifiers used in such systems are neural networks. Support vector machines have also been used in one-modality audio or visual speech recognition, but never in a multimodal audio-visual system. We propose such a hybrid SVM-HMM speech recognizer, and we show how the multimodal approach leads to better performance than that obtained with any of the two modalities individually.

Keywords :

audio-visual systems; hidden Markov models; neural nets; signal classification; speech recognition; support vector machines; audio-visual speech recognition; discriminant classifiers; hidden Markov models; hybrid SVM-HMM system; individual phoneme likelihoods; neural networks; state emission probabilities; support vector machines; Accuracy; Hidden Markov models; Signal to noise ratio; Speech; Speech recognition; Support vector machines; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing Conference, 2005 13th European

Conference_Location :

Antalya

Print_ISBN :

978-160-4238-21-1

Type :

conf

Filename :

7078053

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=698468