DocumentCode
3131914
Title
Improving large vocabulary continuous speech recognition by combining GMM-based and reservoir-based acoustic modeling
Author
Triefenbach, Fabian ; Demuynck, Kris ; Martens, J.
Author_Institution
ELIS Multimedia Lab., Ghent Univ. - IBBT, Ghent, Belgium
fYear
2012
fDate
2-5 Dec. 2012
Firstpage
107
Lastpage
112
Abstract
In earlier work we have shown that good phoneme recognition is possible with a so-called reservoir, a special type of recurrent neural network. In this paper, different architectures based on Reservoir Computing (RC) for large vocabulary continuous speech recognition are investigated. Besides experiments with HMM hybrids, it is shown that a RC-HMM tandem can achieve the same recognition accuracy as a classical HMM, which is a promising result for such a fairly new paradigm. It is also demonstrated that a state-level combination of the scores of the tandem and the baseline HMM leads to a significant improvement over the baseline. A word error rate reduction of the order of 20% relative is possible.
Keywords
Gaussian processes; acoustic signal processing; hidden Markov models; speech recognition; vocabulary; GMM; HMM; RC; phoneme recognition; recurrent neural network; reservoir computing; tandem acoustic modeling; vocabulary continuous speech recognition; Acoustics; Computational modeling; Hidden Markov models; Reservoirs; Speech recognition; Training; Vectors; continuous speech recognition; reservoir computing; tandem acoustic modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location
Miami, FL
Print_ISBN
978-1-4673-5125-6
Electronic_ISBN
978-1-4673-5124-9
Type
conf
DOI
10.1109/SLT.2012.6424206
Filename
6424206
Link To Document