Title :
Speaker dependent visual speech recognition using Extended Curvature Gabor filters
Author :
Jeongwoo Ju ; Heechul Jung ; Junmo Kim
Author_Institution :
Div. of Future Vehicle, KAIST, Daejeon, South Korea
Abstract :
Performance of a speech recognition system often degrades severely under low SNR environment. To overcome this difficulty, the visual signal is also considered as an additional aid these days. In this paper, we address speaker dependent visual speech recognition problem using Extended Curvature Gabor (ECG) wavelet. First, lip image sequences are filtered using the ECG, because the variation of the filter response well represents the lip movement. Next, the distance between the output and training data is calculated using the Multi Dimensional Dynamic Time Warping (MDDTW) with new cost matrix. Finally, the lip sequences are classified into the corresponding utterance. In this process, the parameters of ECG must be selected appropriately, where we compare a simple greedy selection method and selection scheme based on AdaBoost.
Keywords :
Gabor filters; greedy algorithms; image sequences; learning (artificial intelligence); matrix algebra; speaker recognition; wavelet transforms; AdaBoost; ECG; MDDTW; cost matrix; extended curvature Gabor filters; filter response; lip image sequences; multi dimensional dynamic time warping; selection scheme; simple greedy selection method; speaker dependent visual speech recognition system; visual signal; Electrocardiography; Gabor filters; Image sequences; Speech; Speech recognition; Training data; Visualization;
Conference_Titel :
Consumer Electronics (ICCE), 2013 IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-1361-2
DOI :
10.1109/ICCE.2013.6486907