• DocumentCode
    675465
  • Title

    A comparison of several variants of GMM on speech recognition task

  • Author

    Jakovljevic, Niksa M. ; Miskovic, Dragisa ; Pakoci, Edvin ; Grbic, Tatjana ; Delic, Vlado

  • fYear
    2013
  • fDate
    26-28 Nov. 2013
  • Firstpage
    466
  • Lastpage
    469
  • Abstract
    In the automatic speech recognition task, the dominant approach is the statistical framework based on hidden Markov models in combination with Gaussian mixture models. The issues which should be solved are: how to obtain a statistically efficient estimation of model parameters, especially covariance matrix, whose number of parameters is proportional to the square of the dimensionality of the feature space, as well as sufficiently fast and accurate evaluation of observation emission probabilities. This paper provides the evaluation results of several models (diagonal approximation, maximum likelihood linear transformation, semi tied covariance) tested on Serbian SpeechDat corpus. In these experiments, the model with full covariance matrices has achieved the best performance (in terms of accuracy and computation time).
  • Keywords
    Gaussian processes; approximation theory; covariance matrices; hidden Markov models; maximum likelihood estimation; mixture models; speech recognition; GMM; Gaussian mixture models; Serbian SpeechDat corpus; automatic speech recognition task; diagonal approximation; feature dimensionality; full covariance matrix; hidden Markov models; maximum likelihood linear transformation; model parameter estimation; observation emission probabilities; semi tied covariance; statistical framework; Automatic speech recognition; Biological system modeling; Covariance matrices; Electronic mail; Gaussian mixture model; Hidden Markov models; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications Forum (TELFOR), 2013 21st
  • Conference_Location
    Belgrade
  • Print_ISBN
    978-1-4799-1419-7
  • Type

    conf

  • DOI
    10.1109/TELFOR.2013.6716268
  • Filename
    6716268