Title :
Nonlinear manifold learning for visual speech recognition
Author :
Bregler, Christoph ; Omohundro, Stephen M.
Author_Institution :
Div. of Comput. Sci., California Univ., Berkeley, CA, USA
Abstract :
A technique for representing and learning smooth nonlinear manifolds is presented and applied to several lip reading tasks. Given a set of points drawn from a smooth manifold in an abstract feature space, the technique is capable of determining the structure of the surface and of finding the closest manifold point to a given query point. We use this technique to learn the “space of lips” in a visual speech recognition task. The learned manifold is used for tracking and extracting the lips, for interpolating between frames in an image sequence and for providing features for recognition. We describe a system based on hidden Markov models and this learned lip manifold that significantly improves the performance of acoustic speech recognizers in degraded environments. We also present preliminary results on a purely visual lip reader
Keywords :
feature extraction; hidden Markov models; speech recognition; abstract feature space; acoustic speech recognizers; degraded environments; hidden Markov models; image sequence; learned lip manifold; learned manifold; lip extraction; lip reading task; nonlinear manifold learning; purely visual lip reader; query point; smooth nonlinear manifolds; tracking; visual speech recognition; Buildings; Computer science; Degradation; Hidden Markov models; Image recognition; Lips; Loudspeakers; Machine vision; National electric code; Speech recognition;
Conference_Titel :
Computer Vision, 1995. Proceedings., Fifth International Conference on
Conference_Location :
Cambridge, MA
Print_ISBN :
0-8186-7042-8
DOI :
10.1109/ICCV.1995.466899