مرکز منطقه ای اطلاع رساني علوم و فناوري - Consideration of Lombard effect for speechreading

DocumentCode :

1835413

Title :

Consideration of Lombard effect for speechreading

Author :

Huang, Fu Jie ; Chen, Tsuhan

Author_Institution :

Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear :

2001

fDate :

2001

Firstpage :

613

Lastpage :

618

Abstract :

We propose a method for integrating audio and visual information to enhance speech recognition in adverse environments. We train the audio hidden Markov model and the visual hidden Markov model separately, and then use a Viterbi algorithm to decode both channels in parallel. The decoding process is asynchronous between the two channels to capture the asynchronous nature of audio and visual speech. We test the proposed method using speech corrupted by various types of noise and speech with the Lombard effect

Keywords :

Viterbi decoding; acoustic noise; audio coding; hidden Markov models; speech recognition; video coding; Lombard effect; adverse environments; asynchronous Viterbi decoding; enhanced speech recognition; hidden Markov model; integrated audio-visual information; lip-reading; speechreading; Automatic speech recognition; Background noise; Decoding; Degradation; Hidden Markov models; Speech processing; Speech recognition; State-space methods; Testing; Viterbi algorithm;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia Signal Processing, 2001 IEEE Fourth Workshop on

Conference_Location :

Cannes

Print_ISBN :

0-7803-7025-2

Type :

conf

DOI :

10.1109/MMSP.2001.962800

Filename :

962800

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1835413