DocumentCode :
2913311
Title :
Towards a practical lipreading system
Author :
Zhou, Ziheng ; Zhao, Guoying ; Pietikäinen, Matti
Author_Institution :
Comput. Sci. & Eng. Lab., Univ. of Oulu, Oulu, Finland
fYear :
2011
fDate :
20-25 June 2011
Firstpage :
137
Lastpage :
144
Abstract :
A practical lipreading system can be considered either as subject dependent (SD) or subject-independent (SI). An SD system is user-specific, i.e., customized for some particular user while an SI system has to cope with a large number of users. These two types of systems pose variant challenges and have to be treated differently. In this paper, we propose a simple deterministic model to tackle the problem. The model first seeks a low-dimensional manifold where visual features extracted from the frames of a video can be projected onto a continuous deterministic curve embedded in a path graph. Moreover, it can map arbitrary points on the curve back into the image space, making it suitable for temporal interpolation. Based on the model, we develop two separate strategies for SD and SI lipreading. The former is turned into a simple curve-matching problem while for the latter, we propose a video-normalization scheme to improve the system developed by Zhao et al. We evaluated our system on the OuluVS database and achieved recognition rates more than 20% higher than the ones reported by Zhao et al. in both SD and SI testing scenarios.
Keywords :
feature extraction; graph theory; image recognition; image sequences; SI system; curve matching problem; image space; lipreading system; path graph; pose variant challenge; temporal interpolation; user specific system; video frame; video normalization scheme; visual feature extraction; Feature extraction; Hidden Markov models; Mouth; Silicon; Speech; Speech recognition; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4577-0394-2
Type :
conf
DOI :
10.1109/CVPR.2011.5995345
Filename :
5995345
Link To Document :
بازگشت