• DocumentCode
    2875203
  • Title

    Selective EM training of acoustic models based on sufficient statistics of single utterances

  • Author

    Cincarek, Tobias ; Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro

  • Author_Institution
    Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol.
  • fYear
    2005
  • fDate
    27-27 Nov. 2005
  • Firstpage
    168
  • Lastpage
    173
  • Abstract
    In this paper, a new algorithm for selective training of acoustic models is proposed. The algorithm is formulated for an HMM-based model with Gaussian mixture densities, but works in principle for any statistical model, which has sufficient statistics. Since there are too many possibilities for selecting a data subset from a larger database, a heuristic has to be employed. The algorithm is based on deleting single utterances from a data pool temporarily or alternating between successive deletion or addition of utterances. The optimization criterion is the likelihood of the new model parameters given some development data, which can be calculated in a short amount of time based on sufficient statistics. The method is applied to automatically obtain task-dependent acoustic models for infant and elderly speech by selecting utterances from a data pool which are acoustically close to the development data. The proposed method is computationally practical and also addresses the issue of reducing the high costs evolving from the development of applications which make use of speech recognition technology
  • Keywords
    Gaussian processes; acoustic signal processing; expectation-maximisation algorithm; hidden Markov models; speech recognition; speech synthesis; EM training; Gaussian mixture densities; HMM; elderly speech; infant speech; single utterances; speech recognition; task-dependent acoustic models; Acoustic applications; Costs; Databases; Hidden Markov models; Information science; Senior citizens; Speech processing; Speech recognition; Statistics; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7803-9478-X
  • Electronic_ISBN
    0-7803-9479-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2005.1566486
  • Filename
    1566486