• DocumentCode
    294544
  • Title

    Noisy speech recognition using robust inversion of hidden Markov models

  • Author

    Moon, Seokyong ; Hwang, Jeng-Neng

  • Author_Institution
    Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    145
  • Abstract
    The hidden Markov model (HMM) inversion algorithm is proposed and applied to robust speech recognition for general types of mismatched conditions. The Baum-Welch HMM inversion algorithm is a dual procedure to the Baum-Welch HMM reestimation algorithm, which is the most widely used speech recognition technique. The forward training of an HMM, based on the Baum-Welch reestimation, finds the model parameters λ that optimize some criterion, usually maximum likelihood (ML), with given speech inputs s. On the other hand, the inversion of a HMM finds speech inputs s that optimize some criterion with given model parameters λ. The performance of the proposed HMM inversion, in conjunction with HMM reestimation, for robust speech recognition under additive noise corruption and microphone mismatch conditions is favorably compared with other noisy speech recognition techniques, such as the projection-based first-order cepstrum normalization (FOCN) and the robust minimax (MINIMAX) classification techniques
  • Keywords
    hidden Markov models; interference (signal); inverse problems; maximum likelihood estimation; speech recognition; Baum-Welch HMM inversion algorithm; additive noise corruption; classification techniques; forward training; hidden Markov models; maximum likelihood; microphone mismatch; mismatched conditions; noisy speech recognition; projection-based first-order cepstrum normalization; reestimation algorithm; robust inversion; robust minimax; Additive noise; Additive white noise; Automatic speech recognition; Gaussian noise; Hidden Markov models; Microphones; Minimax techniques; Noise robustness; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479385
  • Filename
    479385