• DocumentCode
    3165686
  • Title

    A model structure integration based on a Bayesian framework for speech recognition

  • Author

    Shiota, Sayaka ; Hashimoto, Kei ; Nankaku, Yoshihiko ; Tokuda, Keiichi

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya, Japan
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4813
  • Lastpage
    4816
  • Abstract
    This paper proposes an acoustic modeling technique based on Bayesian framework using multiple model structures for speech recognition. The Bayesian approach is a statistical technique for estimating reliable predictive distributions by marginalizing model parameters, and its effectiveness in HMM-based speech recognition has been reported. Although the basic idea underlying the Bayesian approach is to treat all parameters as random variables, only one model structure is still selected in the conventional method. Multiple model structures are treated as latent variables in the proposed method and integrated based on the Bayesian framework. Furthermore, we applied deterministic annealing to the training algorithm to estimate appropriate acoustic models. The proposed method effectively utilizes multiple model structures, especially in the early stage of training and this leads to better predictive distributions and improvement of recognition performance.
  • Keywords
    Bayes methods; hidden Markov models; speech recognition; Bayesian framework; HMM-based speech recognition; acoustic modeling technique; deterministic annealing; hidden Markov model; latent variables; marginalizing model parameters; multiple model structures; predictive distributions; random variables; statistical technique; training algorithm; Annealing; Bayesian methods; Hidden Markov models; Periodic structures; Predictive models; Speech recognition; Training; Bayesian methods; Deterministic annealing; Hidden Markov model; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288996
  • Filename
    6288996