• DocumentCode
    1617191
  • Title

    Hybrid speech recognition system with discriminative training applied for Romanian language

  • Author

    Gavat, Inge ; Zirra, Matei ; Cula, Oana

  • Author_Institution
    Polytech.. Univ. of Bucharest, Romania
  • Volume
    1
  • fYear
    1998
  • Firstpage
    11
  • Abstract
    This paper describes a hybrid connectionist-statistical system consisting of a neural network integrated in a hidden Markov model (HMM). The neural network used is the multilayer perceptron (MLP) and that network is the mechanism that computes the a-posteriori probability of a sequence of HMMs states. The classifier is based on total scores computed by Viterbi alignment for each hybrid model corresponding to the words in the vocabulary. Because of the lack of discrimination between the models and the unintended discrimination between the states in each model, we propose a solution that improves the system, namely an additional training task based on a cost function that approximates the misclassification rate of the hybrid system. The optimization criterion is based on a descent algorithm and the result is a minimum classification error. Our experiments on a 35 word vocabulary, show an improvement of the recognition rate from 92.4% for the case of a statistical system based only on HMMs, to 94.7% for the case of a hybrid HMM-MLP system, and to 97.9% for the case of an improved hybrid system with an extra discriminative training
  • Keywords
    hidden Markov models; learning (artificial intelligence); multilayer perceptrons; natural languages; probability; signal classification; speech recognition; statistical analysis; Romanian language; Viterbi alignment; a-posteriori probability; cost function; descent algorithm; discriminative training; experiments; hidden Markov model; hybrid HMM-MLP system; hybrid connectionist-statistical system; hybrid speech recognition system; minimum classification error; misclassification rate approximation; multilayer perceptron; recognition rate; speech processing; training task; vocabulary; Artificial neural networks; Backpropagation algorithms; Hidden Markov models; Natural languages; Neural networks; Power system modeling; Speech processing; Speech recognition; Viterbi algorithm; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrotechnical Conference, 1998. MELECON 98., 9th Mediterranean
  • Conference_Location
    Tel-Aviv
  • Print_ISBN
    0-7803-3879-0
  • Type

    conf

  • DOI
    10.1109/MELCON.1998.692163
  • Filename
    692163