مرکز منطقه ای اطلاع رساني علوم و فناوري - GIF-LR:GA-based informative feature for lipreading

DocumentCode :

590666

Title :

GIF-LR:GA-based informative feature for lipreading

Author :

Ukai, Naoya ; Seko, T. ; Tamura, Shinji ; Hayamizu, Satoru

Author_Institution :

Dept. of Inf. Sci., Gifu Univ., Gifu, Japan

fYear :

2012

fDate :

3-6 Dec. 2012

Firstpage :

Lastpage :

Abstract :

In this paper, we propose a general and discriminative feature “GIF” (GA-based Informative Feature), and apply the feature to lipreading (visual speech recognition). The feature extraction method consists of two transforms, that convert an input vector to GIF for recognition. The transforms can be computed using training data and Genetic Algorithm (GA). For lipreading, we extract a fundamental feature as an input vector from an image; the vector consists of intensity values at all the pixels in an input lip image, which are enumerated from left-top to right-bottom. Recognition experiments of continuous digit utterances were conducted using an audio-visual corpus including more than 268,000 lip images. The recognition results show that the GIF-based method is better than the baseline method using eigenlip features.

Keywords :

feature extraction; genetic algorithms; image recognition; speech recognition; GA based informative feature; GIF-LR; discriminative feature; eigenlip feature; feature extraction method; genetic algorithm; lipreading; training data; visual speech recognition; Accuracy; Feature extraction; Genetic algorithms; Hidden Markov models; Speech recognition; Vectors; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific

Conference_Location :

Hollywood, CA

Print_ISBN :

978-1-4673-4863-8

Type :

conf

Filename :

6411813

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=590666