Smoothed N-best-based speaker adaptation for speech recognition

Author

Matsui, Tomoko ; Matsuoka, Tatsuo ; Furui, Sadaoki

Author_Institution

NTT Human Interface Labs., Tokyo, Japan

Volume

2

fYear

1997

fDate

21-24 Apr 1997

Firstpage

1015

Abstract

Smoothed estimation and utterance verification are introduced into the N-best-based speaker adaptation method. That method is effective even for speakers whose decodings using speaker-independent (SI) models are error-prone, that is, for speakers for whom adaptation techniques are truly needed. The smoothed estimation improves the performance for such speakers, and the utterance verification reduces the required amount of calculation. Performance evaluation using connected-digit (four-digit strings) recognition experiments performed over actual telephone lines showed a reduction of 36.4% in the error rates for speakers whose decodings using SI models are error-prone. To try and find an effective model-transformation for speaker adaptation, we discuss replacing mixture-mean bias estimation by the widely used mixture-mean linear-regression-matrix estimation

Keywords

decoding; error statistics; hidden Markov models; matrix algebra; smoothing methods; speaker recognition; speech processing; statistical analysis; adaptation techniques; connected digit recognition experiments; continuous mixture density HMM; decoding; error rate reduction; mixture mean linear regression matrix estimation; model transformation; performance evaluation; smoothed N-best-based speaker adaptation; smoothed estimation; speaker independent models; speech recognition; telephone lines; utterance verification; Adaptation model; Decoding; Equations; Error analysis; Hidden Markov models; Humans; Laboratories; Performance evaluation; Speech recognition; Telephony;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on

Conference_Location

Munich

ISSN

1520-6149

Print_ISBN

0-8186-7919-0

Type

conf

DOI

10.1109/ICASSP.1997.596112

Filename

596112