DocumentCode
730266
Title
Extracting deep bottleneck features for visual speech recognition
Author
Chao Sui ; Togneri, Roberto ; Bennamoun, Mohammed
Author_Institution
Sch. of Comput. Sci. & Software Eng., Univ. of Western Australia, Crawley, WA, Australia
fYear
2015
fDate
19-24 April 2015
Firstpage
1518
Lastpage
1522
Abstract
Motivated by the recent progresses in the use of deep learning techniques for acoustic speech recognition, we present in this paper a visual deep bottleneck feature (DBNF) learning scheme using a stacked auto-encoder combined with other techniques. Experimental results show that our proposed deep feature learning scheme yields approximately 24% relative improvement for visual speech accuracy. To the best of our knowledge, this is the first study which uses deep bottleneck feature on visual speech recognition. Our work firstly shows that the deep bottleneck visual feature is able to achieve a significant accuracy improvement on visual speech recognition.
Keywords
speech recognition; deep bottleneck features; stacked auto-encoder; visual speech accuracy; visual speech recognition; Accuracy; Discrete cosine transforms; Feature extraction; Hidden Markov models; Speech; Speech recognition; Visualization; Visual speech recognition; deep bottleneck feature; stacked denoising auto-encoder;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178224
Filename
7178224
Link To Document