DocumentCode
310508
Title
Broadcast news transcription using HTK
Author
Woodland, P.C. ; Gales, M.J.F. ; Pye, D. ; Young, S.J.
Author_Institution
Dept. of Eng., Cambridge Univ., UK
Volume
2
fYear
1997
fDate
21-24 Apr 1997
Firstpage
719
Abstract
This paper examines the issues in extending a large vocabulary speech recognition system designed for clean and noisy read speech tasks to handle broadcast news transcription. Results using the 1995 DARPA H4 evaluation data set are presented for different front-end analyses and use of unsupervised model adaptation using maximum likelihood linear regression (MLLR). The HTK system for the 1996 H4 evaluation is then described. It includes a number of new features over previous HTK large vocabulary systems including decoder-guided segmentation, segment clustering, cache-based language modelling, and combined MAP and MLLR adaptation. The system runs in multiple passes through the data and the detailed results of each pass are given
Keywords
maximum likelihood estimation; radio broadcasting; speech recognition; television broadcasting; 1995 DARPA H4 evaluation data set; 1996 H4 evaluation; HTK system; broadcast news transcription; cache-based language modelling; combined MAP-MLLR adaptation; decoder-guided segmentation; front-end analyses; large vocabulary speech recognition system; maximum a priori estimation; maximum likelihood linear regression; segment clustering; unsupervised model adaptation; Background noise; Bandwidth; Broadcasting; Hidden Markov models; Maximum likelihood linear regression; Microphones; Robustness; Speech enhancement; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.596005
Filename
596005
Link To Document