Title :
Discrimination comparison between audio and visual features
Author :
Chao Sui ; Togneri, Roberto ; Haque, Showera ; Bennamoun, Mohammed
Author_Institution :
Sch. of Comput. Sci. & Software Eng., Univ. of Western Australia, Perth, WA, Australia
Abstract :
This paper aims at comparing the discrimination between audio, 2D-based visual and 3D-based visual features for the speech recognition purpose. The audio and visual feature extraction schemes and several feature selection techniques are described first in this paper. With the application of the described feature extraction and selection methods, several experiments are conducted to compare the discrimination of the audio features, the 2D visual features and the 3D visual features for the hVd words classification task. In our study, it is found that the 3D visual features have more separability than the 2D visual features, so that the 3D-based audio-visual speech recognition may achieve more desirable results than the traditional 2D-based counterpart.
Keywords :
feature extraction; speech recognition; 2D-based visual feature extraction scheme; 3D-based audio-visual speech recognition; 3D-based visual feature extraction scheme; audio feature extraction scheme; feature selection technique; hVd words classification task;
Conference_Titel :
Signals, Systems and Computers (ASILOMAR), 2012 Conference Record of the Forty Sixth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
Print_ISBN :
978-1-4673-5050-1
DOI :
10.1109/ACSSC.2012.6489302