• DocumentCode
    3131914
  • Title

    Improving large vocabulary continuous speech recognition by combining GMM-based and reservoir-based acoustic modeling

  • Author

    Triefenbach, Fabian ; Demuynck, Kris ; Martens, J.

  • Author_Institution
    ELIS Multimedia Lab., Ghent Univ. - IBBT, Ghent, Belgium
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    107
  • Lastpage
    112
  • Abstract
    In earlier work we have shown that good phoneme recognition is possible with a so-called reservoir, a special type of recurrent neural network. In this paper, different architectures based on Reservoir Computing (RC) for large vocabulary continuous speech recognition are investigated. Besides experiments with HMM hybrids, it is shown that a RC-HMM tandem can achieve the same recognition accuracy as a classical HMM, which is a promising result for such a fairly new paradigm. It is also demonstrated that a state-level combination of the scores of the tandem and the baseline HMM leads to a significant improvement over the baseline. A word error rate reduction of the order of 20% relative is possible.
  • Keywords
    Gaussian processes; acoustic signal processing; hidden Markov models; speech recognition; vocabulary; GMM; HMM; RC; phoneme recognition; recurrent neural network; reservoir computing; tandem acoustic modeling; vocabulary continuous speech recognition; Acoustics; Computational modeling; Hidden Markov models; Reservoirs; Speech recognition; Training; Vectors; continuous speech recognition; reservoir computing; tandem acoustic modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2012 IEEE
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4673-5125-6
  • Electronic_ISBN
    978-1-4673-5124-9
  • Type

    conf

  • DOI
    10.1109/SLT.2012.6424206
  • Filename
    6424206