• DocumentCode
    959901
  • Title

    Linear Spectral Transformation for Robust Speech Recognition Using Maximum Mutual Information

  • Author

    Kim, Donghyun ; Yook, Dongsuk

  • Author_Institution
    Korea Univ., Seoul
  • Volume
    14
  • Issue
    7
  • fYear
    2007
  • fDate
    7/1/2007 12:00:00 AM
  • Firstpage
    496
  • Lastpage
    499
  • Abstract
    This paper presents a transformation-based rapid adaptation technique for robust speech recognition using a linear spectral transformation (LST) and a maximum mutual information (MMI) criterion. Previously, a maximum likelihood linear spectral transformation (ML-LST) algorithm was proposed for fast adaptation in unknown environments. Since the MMI estimation method does not require evenly distributed training data and increases the a posteriori probability of the word sequences of the training data, we combine the linear spectral transformation method and the MMI estimation technique in order to achieve extremely rapid adaptation using only one word of adaptation data. The proposed algorithm, called MMI-LST, was implemented using the extended Baum-Welch algorithm and phonetic lattices, and evaluated on the TIMIT and FFMTIMIT corpora. It provides a relative reduction in the speech recognition error rate of 11.1% using only 0.25 s of adaptation data.
  • Keywords
    estimation theory; spectral analysis; speech processing; speech recognition; MMI estimation technique; extended Baum-Welch algorithm; linear spectral transformation technique; maximum mutual information estimation technique; phonetic lattices; robust speech recognition; transformation-based rapid adaptation technique; Degradation; Error analysis; Lattices; Linear regression; Maximum likelihood estimation; Mutual information; Parameter estimation; Robustness; Speech recognition; Training data; Linear spectral transformation; maximum mutual information (MMI); rapid adaptation; robust speech recognition;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2006.891337
  • Filename
    4244481