• DocumentCode
    66944
  • Title

    Joint Uncertainty Decoding for Noise Robust Subspace Gaussian Mixture Models

  • Author

    Liang Lu ; Chin, Ken K. ; Ghoshal, Arnab ; Renals, Steve

  • Author_Institution
    Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
  • Volume
    21
  • Issue
    9
  • fYear
    2013
  • fDate
    Sept. 2013
  • Firstpage
    1791
  • Lastpage
    1804
  • Abstract
    Joint uncertainty decoding (JUD) is a model-based noise compensation technique for conventional Gaussian Mixture Model (GMM) based speech recognition systems. Unlike vector Taylor series (VTS) compensation which operates on the individual Gaussian components in an acoustic model, JUD clusters the Gaussian components into a smaller number of classes, sharing the compensation parameters for the set of Gaussians in a given class. This significantly reduces the computational cost. In this paper, we investigate noise compensation for subspace Gaussian mixture model (SGMM) based speech recognition systems using JUD. The total number of Gaussian components in an SGMM is typically very large. Therefore direct compensation of the individual Gaussian components, as performed by VTS, is computationally expensive. In this paper we show that JUD-based noise compensation can be successfully applied to SGMMs in a computationally efficient way. We evaluate the JUD/SGMM technique on the standard Aurora 4 corpus. Our experimental results indicate that the JUD/SGMM system results in lower word error rates compared with a conventional GMM system with either VTS-based or JUD-based noise compensation.
  • Keywords
    Gaussian noise; Gaussian processes; compensation; decoding; speech coding; speech recognition; GMM based speech recognition systems; Gaussian components; JUD-SGMM technique; VTS compensation; acoustic model; compensation parameters; joint uncertainty decoding; model-based noise compensation technique; noise robust subspace Gaussian mixture models; standard Aurora 4 corpus; vector Taylor series; word error rates; Adaptation models; Computational modeling; Hidden Markov models; Noise; Speech; Speech recognition; Vectors; Aurora 4; joint uncertainty decoding; noise robust ASR; subspace Gaussian mixture model; vector Taylor series;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2248718
  • Filename
    6469175