• DocumentCode
    312134
  • Title

    Bayesian adaptation of speech recognizers to field speech data

  • Author

    Miglietta, C.G. ; Mokbel, Chajic ; Jouvet, Denis ; Monne, J.

  • Author_Institution
    CNET, Lannion, France
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    917
  • Abstract
    The article studies a Bayesian (or Maximum A Posteriori MAP) approach to the adaptation of continuous density hidden Markov models (CDHMMs) to a specific condition of a speech recognition application. In order to improve the model robustness, CDHMMs formerly trained from laboratory data are then adapted using context dependent field utterances. Two specific problems have to be faced when using the MAP approach: the estimation of the a priori distribution parameters and the lack of field adaptation data for some distributions of the CDHMM. To estimate the a priori distribution parameters, we need to identify different realizations of the model parameters. Three different solutions are proposed and evaluated. To overcome the lack of adaptation data, field acoustical training frames may be shared among similar distributions. This is performed using an acoustical tree, obtained by progressively clustering the model distributions. Recognition results show that MAP adapted models significantly outperform those trained by maximum likelihood (ML), specifically when the field data set is small
  • Keywords
    Bayes methods; acoustic signal processing; hidden Markov models; maximum likelihood estimation; speech recognition; Bayesian adaptation; CDHMMs; MAP adapted models; MAP approach; Maximum A Posteriori MAP; a priori distribution parameters; acoustical tree; context dependent field utterances; continuous density hidden Markov models; field acoustical training frames; field adaptation data; field data set; field speech data; maximum likelihood; model distributions; model parameters; model robustness; speech recognition application; speech recognizers; Adaptation model; Bayesian methods; Covariance matrix; Databases; Equations; Iterative algorithms; Laboratories; Random variables; Robustness; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607751
  • Filename
    607751