DocumentCode
2018484
Title
Audio visual speech recognition based on multi-stream DBN models with Articulatory Features
Author
Jiang, Dong-mei ; Wu, Peng ; Wang, Feng-na ; Sahli, Hichem ; Verhelst, Werner
Author_Institution
Sch. of Comput., Northwestern Polytech. Univ., Xi´´an, China
fYear
2010
fDate
Nov. 29 2010-Dec. 3 2010
Firstpage
190
Lastpage
193
Abstract
We present a multi-stream Dynamic Bayesian Network model with Articulatory Features (AF_AV_DBN) for audio visual speech recognition. Conditional probability distributions of the nodes are defined considering the asynchronies between the articulatory features (AFs). Speech recognition experiments are carried out on an audio visual connected digit database. Results show that comparing with the state synchronous DBN model (SS_DBN) and state asynchronous DBN model (SA_DBN), when the asynchrony constraint between the AFs is appropriately set, the AF_AV_DBN model gets the highest recognition rates, with average recognition rate improved to 89.38% from 87.02% of SS_DBN and 88.32% of SA_DBN. Moreover, the audio visual multi-stream AF_AV_DBN model greatly improves the robustness of the audio only AF_A_DBN model, for example, under the noise of -10dB, the recognition rate is improved from 20.75% to 76.24%.
Keywords
audio-visual systems; belief networks; feature extraction; speech recognition; statistical distributions; AFAVDBN model; articulatory feature; asynchrony constraint; audio visual connected digit database; audio visual speech recognition; conditional probability distribution; multistream DBN model; multistream dynamic Bayesian network model; recognition rate; state asynchronous DBN model; Hidden Markov models; Noise; Speech; Speech processing; Speech recognition; Tongue; Visualization; DBN; articulatory feature; audio-visual; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location
Tainan
Print_ISBN
978-1-4244-6244-5
Type
conf
DOI
10.1109/ISCSLP.2010.5684915
Filename
5684915
Link To Document