DocumentCode :
2703017
Title :
Modeling Duration via Lattice Rescoring
Author :
Jennequin, N. ; Gauvain, J. -L.
Author_Institution :
Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
It is often acknowledged that HMMs do not properly model phone and word durations. In this paper phone and word duration models are used to improve the accuracy of state-of-the-art large vocabulary speech recognition systems. The duration information is integrated into the systems in a rescoring of word lattices that include phone-level segmentations. Experimental results are given for a conversational telephone speech (CTS) task in French and for the TC-Star EPPS transcription task in Spanish and English. An absolute word error rate reduction of about 0.5% is observed for the CTS task, and smaller but consistent gains are observed for the EPPS task in both languages.
Keywords :
hidden Markov models; speech processing; speech recognition; English; French; HMM; Spanish; conversational telephone speech; large vocabulary speech recognition systems; lattice rescoring; phone models; phone-level segmentations; word duration models; word error rate reduction; Decoding; Error analysis; Hidden Markov models; Lattices; Natural languages; Speech recognition; Telephony; Topology; Training data; Vocabulary; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366994
Filename :
4218182
Link To Document :
بازگشت