DocumentCode
590666
Title
GIF-LR:GA-based informative feature for lipreading
Author
Ukai, Naoya ; Seko, T. ; Tamura, Shinji ; Hayamizu, Satoru
Author_Institution
Dept. of Inf. Sci., Gifu Univ., Gifu, Japan
fYear
2012
fDate
3-6 Dec. 2012
Firstpage
1
Lastpage
4
Abstract
In this paper, we propose a general and discriminative feature “GIF” (GA-based Informative Feature), and apply the feature to lipreading (visual speech recognition). The feature extraction method consists of two transforms, that convert an input vector to GIF for recognition. The transforms can be computed using training data and Genetic Algorithm (GA). For lipreading, we extract a fundamental feature as an input vector from an image; the vector consists of intensity values at all the pixels in an input lip image, which are enumerated from left-top to right-bottom. Recognition experiments of continuous digit utterances were conducted using an audio-visual corpus including more than 268,000 lip images. The recognition results show that the GIF-based method is better than the baseline method using eigenlip features.
Keywords
feature extraction; genetic algorithms; image recognition; speech recognition; GA based informative feature; GIF-LR; discriminative feature; eigenlip feature; feature extraction method; genetic algorithm; lipreading; training data; visual speech recognition; Accuracy; Feature extraction; Genetic algorithms; Hidden Markov models; Speech recognition; Vectors; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location
Hollywood, CA
Print_ISBN
978-1-4673-4863-8
Type
conf
Filename
6411813
Link To Document