• DocumentCode
    3530396
  • Title

    Robust speech recognition based on structured modeling, irrelevant variability normalization and unsupervised online adaptation

  • Author

    Huo, Qiang ; Zhu, Donglai

  • Author_Institution
    Microsoft Res. Asia, Beijing
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4637
  • Lastpage
    4640
  • Abstract
    We present a new approach to robust speech recognition based on structured modeling, irrelevant variability normalization (IVN) and unsupervised online adaptation (OLA). In offline training stage, a set of generic HMMs for basic speech units relevant to phonetic classification is trained along with several sets of feature transforms with different degrees of freedom by using a maximum likelihood (ML) IVN-based training strategy. In recognition stage, after a first-pass recognition, the most appropriate set of feature transforms is identified and adapted under ML criterion by using the unknown utterance itself, which is recognized again to achieve better performance by using the adapted feature transforms and the pre-trained generic HMMs. The effectiveness of the proposed approach is confirmed by evaluation experiments on Finnish Aurora3 database.
  • Keywords
    hidden Markov models; maximum likelihood estimation; speech processing; speech recognition; Finnish Aurora3 database; IVN-based training strategy; feature transforms; generic HMM; irrelevant variability normalization; maximum likelihood; phonetic classification; robust speech recognition; structured modeling; unsupervised online adaptation; Asia; Automatic speech recognition; Decoding; Gaussian processes; Hidden Markov models; Labeling; Robustness; Spatial databases; Speech recognition; Training data; feature transformation; irrelevant variability normalization; online adaptation; robust speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960664
  • Filename
    4960664