DocumentCode
3367506
Title
A real-time automatic lipreading system
Author
Wang, S.L. ; Lau, W.H. ; Leung, S.H. ; Yan, H.
Author_Institution
Dept. of Comput. Eng. & Inf. Technol., City Univ. of Hong Kong, China
Volume
2
fYear
2004
fDate
23-26 May 2004
Abstract
It´s well known that visual information such as lip shape and its movement can indicate what the speaker is talking about. In this paper, we present an automatic lipreading system solely using visual information for recognizing isolated English digits from 0 to 9. A parameter set of a 14-point ASM lip model is used to describe the outer lip contour. The inner mouth information such as the teeth region and the mouth opening are also extracted. With appropriate normalization, the feature vectors containing the normalized outer lip features, inner mouth features and also their first order derivatives are obtained for training the HMM word models. Experiments have been carried out to investigate the recognition performance using our visual feature set compared with other traditional visual feature representations. An accuracy of 93% for speaker dependent recognition and 84% for speaker independent recognition is achieved using our visual feature representation. A real-time automatic lipreading system has been successfully implemented on a 1.9-GHz PC.
Keywords
feature extraction; image colour analysis; image representation; real-time systems; speaker recognition; 1.9 GHz; English digit recognition; first order derivatives; image colour analysis; lip model; lip shape recognition; normalized outer lip feature vectors; outer lip contour; real time automatic lipreading system; speaker dependent recognition; speaker independent recognition; visual feature set representation; visual information; Data mining; Feature extraction; Hidden Markov models; Image segmentation; Mouth; Real time systems; Robustness; Shape; Speech recognition; Teeth;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2004. ISCAS '04. Proceedings of the 2004 International Symposium on
Print_ISBN
0-7803-8251-X
Type
conf
DOI
10.1109/ISCAS.2004.1329218
Filename
1329218
Link To Document