• DocumentCode
    2768602
  • Title

    High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series

  • Author

    Li, Jinyu ; Deng, Li ; Yu, Dong ; Gong, Yifan ; Acero, Alex

  • Author_Institution
    Microsoft Corp., Redmond
  • fYear
    2007
  • fDate
    9-13 Dec. 2007
  • Firstpage
    65
  • Lastpage
    70
  • Abstract
    In this paper, we present our recent development of a model-domain environment-robust adaptation algorithm, which demonstrates high performance in the standard Aurora 2 speech recognition task. The algorithm consists of two main steps. First, the noise and channel parameters are estimated using a nonlinear environment distortion model in the cepstral domain, the speech recognizer\´s "feedback" information, and the vector-Taylor-series (VTS) linearization technique collectively. Second, the estimated noise and channel parameters are used to adapt the static and dynamic portions of the HMM means and variances. This two-step algorithm enables joint compensation of both additive and convolutive distortions (JAC). In the experimental evaluation using the standard Aurora 2 task, the proposed JAC/VTS algorithm achieves 91.11% accuracy using the clean-trained simple HMM backend as the baseline system for the model adaptation. This represents high recognition performance on this task without discriminative training of the HMM system. Detailed analysis on the experimental results shows that adaptation of the dynamic portion of the HMM mean and variance parameters is critical to the success of our algorithm.
  • Keywords
    distortion; hidden Markov models; linearisation techniques; speech recognition; telecommunication channels; Aurora 2 speech recognition task; additive distortion; channel parameter; convolutive distortion; high-performance HMM adaptation; linearization technique; model-domain environment-robust adaptation algorithm; nonlinear environment distortion model; speech recognizer feedback information; vector Taylor series; Adaptation model; Cepstral analysis; Hidden Markov models; Nonlinear distortion; Parameter estimation; Speech enhancement; Speech recognition; Standards development; Taylor series; Working environment noise; additive and convolutive distortions; joint compensation; robust ASR; vector Taylor series;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4244-1746-9
  • Electronic_ISBN
    978-1-4244-1746-9
  • Type

    conf

  • DOI
    10.1109/ASRU.2007.4430085
  • Filename
    4430085