Title :
GIF-LR:GA-based informative feature for lipreading
Author :
Ukai, Naoya ; Seko, T. ; Tamura, Shinji ; Hayamizu, Satoru
Author_Institution :
Dept. of Inf. Sci., Gifu Univ., Gifu, Japan
Abstract :
In this paper, we propose a general and discriminative feature “GIF” (GA-based Informative Feature), and apply the feature to lipreading (visual speech recognition). The feature extraction method consists of two transforms, that convert an input vector to GIF for recognition. The transforms can be computed using training data and Genetic Algorithm (GA). For lipreading, we extract a fundamental feature as an input vector from an image; the vector consists of intensity values at all the pixels in an input lip image, which are enumerated from left-top to right-bottom. Recognition experiments of continuous digit utterances were conducted using an audio-visual corpus including more than 268,000 lip images. The recognition results show that the GIF-based method is better than the baseline method using eigenlip features.
Keywords :
feature extraction; genetic algorithms; image recognition; speech recognition; GA based informative feature; GIF-LR; discriminative feature; eigenlip feature; feature extraction method; genetic algorithm; lipreading; training data; visual speech recognition; Accuracy; Feature extraction; Genetic algorithms; Hidden Markov models; Speech recognition; Vectors; Visualization;
Conference_Titel :
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location :
Hollywood, CA
Print_ISBN :
978-1-4673-4863-8