مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust geometrical-based lip-reading using Hidden Markov models

DocumentCode :

3429179

Title :

Robust geometrical-based lip-reading using Hidden Markov models

Author :

Ibrahim, M.Z. ; Mulvaney, D.J.

Author_Institution :

Sch. of Electron., Electr. & Syst. Eng., Loughborough Univ., Loughborough, UK

fYear :

2013

fDate :

1-4 July 2013

Firstpage :

2011

Lastpage :

2016

Abstract :

Lip reading is a process used to recognize speech from the viewed physical movements of the lips. In this paper, we present a new automatic lip-reading system that uses geometrical information extracted from video sequences in the classification of dynamic lip movements and implemented in four variants of Hidden Markov Models. In the recognition of the English digits 0 to 9 as spoken by the subjects available in the CUAVE database, the proposed system is able to produce a word recognition performance of up to 68%, a result better than that obtained using a conventional appearance-based Discrete Cosine Transform technique. The two approaches are also compared when operating under simulated changes in environment conditions that arise from head movements and alterations in image illumination. The performance of the appearance-based approach was adversely affected by such rotational and brightness changes, yet the performance of the geometrical-based method remained consistent, demonstrating its potential to be effective as part of a multimodal speech recognition system for use in noisy environments.

Keywords :

computational geometry; discrete cosine transforms; gesture recognition; handicapped aids; hidden Markov models; image classification; image sequences; natural language processing; speech recognition; video signal processing; CUAVE database; English digits; appearance-based discrete cosine transform technique; automatic lip-reading system; brightness changes; dynamic lip movement classification; geometrical information extraction; head movements; hidden Markov models; image illumination; multimodal speech recognition system; people-with-hearing impairments; robust geometrical-based lip-reading; rotational changes; video sequences; word recognition performance; Brightness; Discrete cosine transforms; Feature extraction; Geometry; Head; Hidden Markov models; Visualization; OpenCV; convex hull; discrete cosine transform; hidden markov models; lip geometry; lip reading; skin detection;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

EUROCON, 2013 IEEE

Conference_Location :

Zagreb

Print_ISBN :

978-1-4673-2230-0

Type :

conf

DOI :

10.1109/EUROCON.2013.6625256

Filename :

6625256

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3429179