DocumentCode :
590666
Title :
GIF-LR:GA-based informative feature for lipreading
Author :
Ukai, Naoya ; Seko, T. ; Tamura, Shinji ; Hayamizu, Satoru
Author_Institution :
Dept. of Inf. Sci., Gifu Univ., Gifu, Japan
fYear :
2012
fDate :
3-6 Dec. 2012
Firstpage :
1
Lastpage :
4
Abstract :
In this paper, we propose a general and discriminative feature “GIF” (GA-based Informative Feature), and apply the feature to lipreading (visual speech recognition). The feature extraction method consists of two transforms, that convert an input vector to GIF for recognition. The transforms can be computed using training data and Genetic Algorithm (GA). For lipreading, we extract a fundamental feature as an input vector from an image; the vector consists of intensity values at all the pixels in an input lip image, which are enumerated from left-top to right-bottom. Recognition experiments of continuous digit utterances were conducted using an audio-visual corpus including more than 268,000 lip images. The recognition results show that the GIF-based method is better than the baseline method using eigenlip features.
Keywords :
feature extraction; genetic algorithms; image recognition; speech recognition; GA based informative feature; GIF-LR; discriminative feature; eigenlip feature; feature extraction method; genetic algorithm; lipreading; training data; visual speech recognition; Accuracy; Feature extraction; Genetic algorithms; Hidden Markov models; Speech recognition; Vectors; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location :
Hollywood, CA
Print_ISBN :
978-1-4673-4863-8
Type :
conf
Filename :
6411813
Link To Document :
بازگشت