DocumentCode :
3324576
Title :
Nonlinear manifold learning for visual speech recognition
Author :
Bregler, Christoph ; Omohundro, Stephen M.
Author_Institution :
Div. of Comput. Sci., California Univ., Berkeley, CA, USA
fYear :
1995
fDate :
20-23 Jun 1995
Firstpage :
494
Lastpage :
499
Abstract :
A technique for representing and learning smooth nonlinear manifolds is presented and applied to several lip reading tasks. Given a set of points drawn from a smooth manifold in an abstract feature space, the technique is capable of determining the structure of the surface and of finding the closest manifold point to a given query point. We use this technique to learn the “space of lips” in a visual speech recognition task. The learned manifold is used for tracking and extracting the lips, for interpolating between frames in an image sequence and for providing features for recognition. We describe a system based on hidden Markov models and this learned lip manifold that significantly improves the performance of acoustic speech recognizers in degraded environments. We also present preliminary results on a purely visual lip reader
Keywords :
feature extraction; hidden Markov models; speech recognition; abstract feature space; acoustic speech recognizers; degraded environments; hidden Markov models; image sequence; learned lip manifold; learned manifold; lip extraction; lip reading task; nonlinear manifold learning; purely visual lip reader; query point; smooth nonlinear manifolds; tracking; visual speech recognition; Buildings; Computer science; Degradation; Hidden Markov models; Image recognition; Lips; Loudspeakers; Machine vision; National electric code; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision, 1995. Proceedings., Fifth International Conference on
Conference_Location :
Cambridge, MA
Print_ISBN :
0-8186-7042-8
Type :
conf
DOI :
10.1109/ICCV.1995.466899
Filename :
466899
Link To Document :
بازگشت