DocumentCode
177689
Title
Lip Reading Using Wavelet-Based Features and Random Forests Classification
Author
Terissi, L.D. ; Parodi, M. ; Gomez, J.C.
Author_Institution
Lab. for Syst. Dynamics & Signal Process., Univ. Nac. de Rosario, Rosario, Argentina
fYear
2014
fDate
24-28 Aug. 2014
Firstpage
791
Lastpage
796
Abstract
In this paper, a visual speech classification scheme based on wavelets and Random Forests (RF) is proposed. Wavelet multiresolution analysis is used to model the sequence of visual parameters, represented by either model-based or image-based features. The coefficients associated with these representations are used as features to model the visual information. Lip reading is then performed using these wavelet-based features and a Random Forests classification method. The performance of the proposed visual speech classification scheme is evaluated with three different isolated word audio-visual databases, two of them public ones and the other compiled by the authors of this paper. Experimental results show that a good performance is achieved with the proposed lip reading system over the three databases. In addition, the proposed method performs better than other reported methods in the literature over the two public databases. Experiments over the three different databases were performed using the same configuration, i.e., there was no need to adapt the wavelet decomposition parameters or the RF classifier parameters to each particular database.
Keywords
audio databases; image classification; image resolution; speech recognition; visual databases; wavelet transforms; Random Forests classification method; audio-visual databases; lip reading system; visual speech classification scheme; wavelet decomposition parameters; wavelet-based features; Hidden Markov models; Mouth; Speech; Speech recognition; Visual databases; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location
Stockholm
ISSN
1051-4651
Type
conf
DOI
10.1109/ICPR.2014.146
Filename
6976856
Link To Document