DocumentCode
423786
Title
A robust hierarchical lip tracking approach for lipreading and audio visual speech recognition
Author
Xie, Lei ; Cai, Xiu-Li ; Fu, Zhong-Hwa ; Zhao, Rong-chun ; Jiang, Dong-mei
Author_Institution
Sch. of Comput. Sci., Northwestern Polytech Univ., Xi´´an, China
Volume
6
fYear
2004
fDate
26-29 Aug. 2004
Firstpage
3620
Abstract
This paper presents a robust hierarchical lip tracking approach (RoHiLTA) for lip-reading and audio visual speech recognition (AVSR) applications. Lip regions of interest are subtly detected by motion and facial structure information. Improvements are made on active shape models (ASMs) for extracting lip contours more accurately and efficiently from video sequences of a speaker´s talking face in natural lighting conditions and without particular make-ups. Local and global ASM search algorithms are both improved by introducing color information, 2D mouth corner match, and robust estimation. For noise-free features, localization errors are automatically corrected by an interpolating scheme. A fast implementation of the hierarchical approach is also proposed. Extensive experiments show that the improved ASM can effectively reduce the lip locating errors. The fast implementation of RoHiLTA can consistently achieve superior performance to conventional ASMs in lip tracking tasks, and then can be effectively integrated in lip-reading and AVSR systems.
Keywords
audio-visual systems; feature extraction; handicapped aids; image colour analysis; image motion analysis; image sequences; interpolation; object detection; speech recognition; active shape models; audio visual speech recognition; facial structure information detection; interpolating scheme; localization error correction; robust hierarchical lip tracking; search algorithms; video sequences; Active shape model; Colored noise; Data mining; Face detection; Motion detection; Mouth; Noise robustness; Noise shaping; Speech recognition; Video sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN
0-7803-8403-2
Type
conf
DOI
10.1109/ICMLC.2004.1380425
Filename
1380425
Link To Document