Title :
Visual speech recognition by recurrent neural networks
Author :
Rabi, Gihad ; Lu, Siwei
Author_Institution :
Dept. of Comput. Sci., Memorial Univ. of Newfoundland, St. John´´s, Nfld., Canada
Abstract :
A system for visual speech recognition is described in this paper. In the first phase of the system´s operation, time-varying visual speech patterns are obtained from a sequence of images. In the second phase, the system uses recurrent neural networks to classify the spatio-temporal pattern as one of the previously-trained words. By specifying a certain behavior when a recurrent network is presented with exemplar sequences, the network is trained with no more than feed-forward complexity. The network´s desired behavior is based on characterizing a given word by well-defined segments. Adaptive segmentation is employed to segment the training sequences of a given word
Keywords :
feature extraction; image recognition; image segmentation; image sequences; recurrent neural nets; speech recognition; adaptive segmentation; feed-forward complexity; image sequence; recurrent neural networks; spatio-temporal pattern; time-varying visual speech patterns; training sequences; visual speech recognition; Automatic speech recognition; Computer science; Crosstalk; Feedforward systems; Image segmentation; Mouth; Recurrent neural networks; Shape; Speech analysis; Speech recognition;
Conference_Titel :
Electrical and Computer Engineering, 1997. Engineering Innovation: Voyage of Discovery. IEEE 1997 Canadian Conference on
Conference_Location :
St. Johns, Nfld.
Print_ISBN :
0-7803-3716-6
DOI :
10.1109/CCECE.1997.614788