DocumentCode
430846
Title
RBF neural network mouth tracking for audio-visual speech recognition system
Author
Hui, Lim Ee ; Seng, K.P. ; Tse, K.M.
Author_Institution
Sch. of Eng., Monash Univ., Malaysia
Volume
A
fYear
2004
fDate
21-24 Nov. 2004
Firstpage
84
Abstract
A great interest in the research of audio-visual speech recognition (AVSR) systems is driven by the increase in the number of multimedia applications that require robust speech recognition systems. The use of visual features in AVSR is justified by both the audio and visual modality of the speech generation and the need for features that are invariant to acoustic noise perturbation. The performance of the AVSR system relies on a robust set of visual features obtained from the accurate detection and tracking of the mouth region. Therefore the mouth tracking plays a major role in AVSR systems. This paper presents an improvement version of mouth tracking technique using radial basis function neural network (RBF NN) with its applications to AVSR systems. A modified extended Kalman filter (EKF) is used to adjust the parameters of the RBF NN. Simulation results have revealed good performance of the proposed method.
Keywords
Kalman filters; acoustic noise; audio-visual systems; nonlinear filters; radial basis function networks; speech recognition; RBF neural network; acoustic noise perturbation; audio-visual speech recognition system; extended Kalman filter; mouth tracking; radial basis function neural network; speech generation; Acoustic noise; Acoustic signal detection; Application software; Automatic speech recognition; Hidden Markov models; Humans; Mouth; Neural networks; Noise robustness; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2004. 2004 IEEE Region 10 Conference
Print_ISBN
0-7803-8560-8
Type
conf
DOI
10.1109/TENCON.2004.1414362
Filename
1414362
Link To Document