• DocumentCode
    310573
  • Title

    Speaker adaptation in the Philips system for large vocabulary continuous speech recognition

  • Author

    Thelen, Eric ; Aubert, Xavier ; Beyerlein, Peter

  • Author_Institution
    Philips Res. Labs. GmbH, Aachen, Germany
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1035
  • Abstract
    The combination of maximum likelihood linear regression (MLLR) with maximum a posteriori (MAP) adaptation has been investigated for both the enrollment of a new speaker as well as for the asymptotic recognition rate after several hours of dictation. We show that a least mean square approach to MLLR is quite effective in conjunction with phonetically derived regression classes. Results are presented for both ARPA read-speech test sets and real-life dictation. Significant improvements are reported. While MLLR achieves a faster adaptation rate when only few data is available, MAP has desirable asymptotic properties and the combination of both methods provides the best results. Both incremental and iterative batch modes are studied and compared to the performance of speaker-dependent training
  • Keywords
    dictation; iterative methods; least mean squares methods; maximum likelihood estimation; speech recognition; ARPA read-speech test sets; MAP; Philips system; asymptotic recognition rate; incremental batch modes; iterative batch modes; large vocabulary continuous speech recognition; least mean square approach; maximum a posteriori adaptation; maximum likelihood linear regression; phonetically derived regression classes; real-life dictation; speaker adaptation; speaker-dependent training; Bayesian methods; Error analysis; Laboratories; Loudspeakers; Maximum likelihood estimation; Maximum likelihood linear regression; Speech recognition; Testing; Vectors; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596117
  • Filename
    596117