DocumentCode :
3429179
Title :
Robust geometrical-based lip-reading using Hidden Markov models
Author :
Ibrahim, M.Z. ; Mulvaney, D.J.
Author_Institution :
Sch. of Electron., Electr. & Syst. Eng., Loughborough Univ., Loughborough, UK
fYear :
2013
fDate :
1-4 July 2013
Firstpage :
2011
Lastpage :
2016
Abstract :
Lip reading is a process used to recognize speech from the viewed physical movements of the lips. In this paper, we present a new automatic lip-reading system that uses geometrical information extracted from video sequences in the classification of dynamic lip movements and implemented in four variants of Hidden Markov Models. In the recognition of the English digits 0 to 9 as spoken by the subjects available in the CUAVE database, the proposed system is able to produce a word recognition performance of up to 68%, a result better than that obtained using a conventional appearance-based Discrete Cosine Transform technique. The two approaches are also compared when operating under simulated changes in environment conditions that arise from head movements and alterations in image illumination. The performance of the appearance-based approach was adversely affected by such rotational and brightness changes, yet the performance of the geometrical-based method remained consistent, demonstrating its potential to be effective as part of a multimodal speech recognition system for use in noisy environments.
Keywords :
computational geometry; discrete cosine transforms; gesture recognition; handicapped aids; hidden Markov models; image classification; image sequences; natural language processing; speech recognition; video signal processing; CUAVE database; English digits; appearance-based discrete cosine transform technique; automatic lip-reading system; brightness changes; dynamic lip movement classification; geometrical information extraction; head movements; hidden Markov models; image illumination; multimodal speech recognition system; people-with-hearing impairments; robust geometrical-based lip-reading; rotational changes; video sequences; word recognition performance; Brightness; Discrete cosine transforms; Feature extraction; Geometry; Head; Hidden Markov models; Visualization; OpenCV; convex hull; discrete cosine transform; hidden markov models; lip geometry; lip reading; skin detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
EUROCON, 2013 IEEE
Conference_Location :
Zagreb
Print_ISBN :
978-1-4673-2230-0
Type :
conf
DOI :
10.1109/EUROCON.2013.6625256
Filename :
6625256
Link To Document :
بازگشت