Title :
Review of the lip-reading recognition
Author :
Zeliang Zhang ; Wenliang Qu ; Fumei Liu
Author_Institution :
Inst. of Inf. Technol. & Media, Beihua Univ., Jilin, China
Abstract :
As an important component of the future HumanComputer Interface, Automatic Speech Recognition is designed for the purpose of realizing identification recognition and natural language comprehension by means of human voice. Speech recognition technology has acquired significant achievements with some successful popularity and applications. IBM´s ViaVoice system, for instance, has good performances when the vocabulary pool is small and when the noise is low. But its performance will be greatly degraded when used in real application environments. In future applications of the humancomputer interaction, such as in a car, at airport, or live interviews, higher requirements for robust systems will be needed, therefore we need to explore new ways. Proved highly effective by most researchers, the combination of visual features of lip motion with vocal features can raise the recognition rate of the automatic speech system, and make it more robust and more adaptable to real environments. This focuses on the recognition methods and speech and visual fusion algorithm, which aims to attract more researchers to be interested and concerned in this area of research.
Keywords :
human computer interaction; image classification; image fusion; natural language processing; speech recognition; IBM ViaVoice system; automatic speech system recognition rate; human voice; human-computer interface; identification recognition; lip motion; lip-reading recognition; natural language comprehension; speech-visual fusion algorithm; visual features; vocal features; Artificial neural networks; Hidden Markov models; Speech; Speech recognition; Training; Visualization; Vocabulary; ANN; HMM; fusion algorithm;
Conference_Titel :
Software Engineering and Service Science (ICSESS), 2014 5th IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-3278-8
DOI :
10.1109/ICSESS.2014.6933638