DocumentCode :
3131914
Title :
Improving large vocabulary continuous speech recognition by combining GMM-based and reservoir-based acoustic modeling
Author :
Triefenbach, Fabian ; Demuynck, Kris ; Martens, J.
Author_Institution :
ELIS Multimedia Lab., Ghent Univ. - IBBT, Ghent, Belgium
fYear :
2012
fDate :
2-5 Dec. 2012
Firstpage :
107
Lastpage :
112
Abstract :
In earlier work we have shown that good phoneme recognition is possible with a so-called reservoir, a special type of recurrent neural network. In this paper, different architectures based on Reservoir Computing (RC) for large vocabulary continuous speech recognition are investigated. Besides experiments with HMM hybrids, it is shown that a RC-HMM tandem can achieve the same recognition accuracy as a classical HMM, which is a promising result for such a fairly new paradigm. It is also demonstrated that a state-level combination of the scores of the tandem and the baseline HMM leads to a significant improvement over the baseline. A word error rate reduction of the order of 20% relative is possible.
Keywords :
Gaussian processes; acoustic signal processing; hidden Markov models; speech recognition; vocabulary; GMM; HMM; RC; phoneme recognition; recurrent neural network; reservoir computing; tandem acoustic modeling; vocabulary continuous speech recognition; Acoustics; Computational modeling; Hidden Markov models; Reservoirs; Speech recognition; Training; Vectors; continuous speech recognition; reservoir computing; tandem acoustic modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4673-5125-6
Electronic_ISBN :
978-1-4673-5124-9
Type :
conf
DOI :
10.1109/SLT.2012.6424206
Filename :
6424206
Link To Document :
بازگشت