• DocumentCode
    417162
  • Title

    MPE-based discriminative linear transform for speaker adaptation

  • Author

    Wang, L. ; Woodland, P.C.

  • Author_Institution
    Eng. Dept., Cambridge Univ., UK
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We present a discriminative method for speaker adaptation, where the minimum phone error (MPE) criterion is used to estimate the discriminative linear transforms (DLTs), including both mean and diagonal variance transforms. The I-smoothing technique is essential to improve the generalization of DLTs. Experiments on supervised adaptation for non-native speakers on the North American Business (NAB) Spoke 3 task show that MPE-based DLT outperforms both MLLR and a previously proposed discriminative method for transform estimation. Preliminary experiments on unsupervised DLT estimation are also reported for conversational telephone speech transcription.
  • Keywords
    maximum likelihood estimation; natural languages; speech recognition; transforms; MPE; conversational telephone speech transcription; diagonal variance transform; discriminative linear transform; maximum likelihood estimation; mean transform; minimum phone error; smoothing technique; speaker adaptation; transform estimation; Error correction; Hidden Markov models; Laboratories; Machine intelligence; Maximum likelihood estimation; Maximum likelihood linear regression; Parameter estimation; Speech; Statistics; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325987
  • Filename
    1325987