Title :
The 1998 HTK system for transcription of conversational telephone speech
Author :
Hain, T. ; Woodland, P.C. ; Niesler, T.R. ; Whittaker, E.W.D.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
This paper describes the 1998 HTK large vocabulary speech recognition system for conversational telephone speech as used in the NIST 1998 Hub5E evaluation. Front-end and language modelling experiments conducted using various training and test sets from both the Switchboard and Callhome English corpora are presented. Our complete system includes reduced bandwidth analysis, side-based cepstral feature normalisation, vocal tract length normalisation (VTLN), triphone and quinphone hidden Markov models (HMMs) built using speaker adaptive training (SAT), maximum likelihood linear regression (MLLR) speaker adaptation and a confidence score based system combination. A detailed description of the complete system together with experimental results for each stage of our multi-pass decoding scheme is presented. The word error rate obtained is almost 20% better than our 1997 system on the development set
Keywords :
adaptive systems; cepstral analysis; feature extraction; hidden Markov models; maximum likelihood estimation; natural languages; speech recognition; telephony; 1998 HTK system; Callhome English corpora; NIST 1998 Hub5E evaluation; Switchboard English corpora; confidence score; conversational telephone speech; experimental results; front-end experiment; hidden Markov models; language modelling experiment; large vocabulary speech recognition system; maximum likelihood linear regression; multi-pass decoding; quinphone HMM; reduced bandwidth analysis; side-based cepstral feature normalisation; speaker adaptation; speaker adaptive training; test set; training set; transcription; triphone HMM; vocal tract length normalisation; word error rate; Cepstral analysis; Hidden Markov models; Maximum likelihood linear regression; NIST; Natural languages; Speech analysis; Speech recognition; Telephony; Testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758061