DocumentCode :
3496733
Title :
Selecting relevant visual features for speechreading
Author :
Estellers, V. ; Gurban, M. ; Thiran, J.-P.
Author_Institution :
Signal Process. Lab. 5, Ecole Polytech. Fed. de Lausanne, Lausanne, Switzerland
fYear :
2009
fDate :
7-10 Nov. 2009
Firstpage :
1433
Lastpage :
1436
Abstract :
A quantitative measure of relevance is proposed for the task of constructing visual feature sets which are at the same time relevant and compact. A feature´s relevance is given by the amount of information that it contains about the problem, while compactness is achieved by preventing the replication of information between features. To achieve these goals, we use mutual information both for assessing relevance and measuring the redundancy between features. Our application is speechreading, that is, speech recognition performed on the video of the speaker. This is justified by the fact that the performance of audio speech recognition can be improved by augmenting the audio features with visual ones, especially when there is noise in the audio channel. We report significant improvements compared to the most common method of dimensionality reduction for speechreading, Linear Discriminant Analysis (LDA).
Keywords :
audio signal processing; speech recognition; audio channel noise; audio features; audio speech recognition; information replication; linear discriminant analysis; mutual information; quantitative measure; selecting relevant visual features; speechreading; time compact; time relevant; Active shape model; Data mining; Discrete cosine transforms; Feature extraction; Image color analysis; Information analysis; Linear discriminant analysis; Mouth; Mutual information; Speech recognition; Feature extraction; image processing; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing (ICIP), 2009 16th IEEE International Conference on
Conference_Location :
Cairo
ISSN :
1522-4880
Print_ISBN :
978-1-4244-5653-6
Electronic_ISBN :
1522-4880
Type :
conf
DOI :
10.1109/ICIP.2009.5414563
Filename :
5414563
Link To Document :
بازگشت