DocumentCode :
1804187
Title :
Discrimination comparison between audio and visual features
Author :
Chao Sui ; Togneri, Roberto ; Haque, Showera ; Bennamoun, Mohammed
Author_Institution :
Sch. of Comput. Sci. & Software Eng., Univ. of Western Australia, Perth, WA, Australia
fYear :
2012
fDate :
4-7 Nov. 2012
Firstpage :
1609
Lastpage :
1612
Abstract :
This paper aims at comparing the discrimination between audio, 2D-based visual and 3D-based visual features for the speech recognition purpose. The audio and visual feature extraction schemes and several feature selection techniques are described first in this paper. With the application of the described feature extraction and selection methods, several experiments are conducted to compare the discrimination of the audio features, the 2D visual features and the 3D visual features for the hVd words classification task. In our study, it is found that the 3D visual features have more separability than the 2D visual features, so that the 3D-based audio-visual speech recognition may achieve more desirable results than the traditional 2D-based counterpart.
Keywords :
feature extraction; speech recognition; 2D-based visual feature extraction scheme; 3D-based audio-visual speech recognition; 3D-based visual feature extraction scheme; audio feature extraction scheme; feature selection technique; hVd words classification task;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers (ASILOMAR), 2012 Conference Record of the Forty Sixth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
ISSN :
1058-6393
Print_ISBN :
978-1-4673-5050-1
Type :
conf
DOI :
10.1109/ACSSC.2012.6489302
Filename :
6489302
Link To Document :
بازگشت