DocumentCode :
3646034
Title :
Convolutive Bottleneck Network features for LVCSR
Author :
Karel Veselý;Martin Karafiát;František Grézl
Author_Institution :
Speech@FIT, Brno University of Technology, Bož
fYear :
2011
Firstpage :
42
Lastpage :
47
Abstract :
In this paper, we focus on improvements of the bottleneck ANN in a Tandem LVCSR system. First, the influence of training set size and the ANN size is evaluated. Second, a very positive effect of linear bottleneck is shown. Finally a Convolutive Bottleneck Network is proposed as extension of the current state-of-the-art Universal Context Network. The proposed training method leads to 5.5% relative reduction of WER, compared to the Universal Context ANN baseline. The relative improvement compared to the 5-layer single-bottleneck network is 17.7%. The dataset ctstrain07 composed of more than 2000 hours of English Conversational Telephone Speech was used for the experiments. The TNet toolkit with CUDA GPGPU implementation was used for fast training.
Keywords :
"Artificial neural networks","Context","Training","Hidden Markov models","Accuracy","Topology","Training data"
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Print_ISBN :
978-1-4673-0365-1
Type :
conf
DOI :
10.1109/ASRU.2011.6163903
Filename :
6163903
Link To Document :
بازگشت