DocumentCode :
394343
Title :
A phone recognizer helps to recognize words better
Author :
Stemmer, Georg ; Zeissler, Viktor ; Hacker, Christian ; Noth, Elmar ; Niemann, Heinrich
Author_Institution :
Lehrstuhl fur Mustererkennung (Informatik 5), Erlangen-Nurnberg Univ., Erlangen, Germany
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
For most speech recognition systems dynamic features are the only way to incorporate temporal context into the output distributions of the HMM. In this paper we propose an efficient method to utilize a large context in the recognition process. State scores of a phone recognizer which runs in parallel to the word recognizer are computed. Integrating these scores in the HMM of the word recognizer makes their output densities context-dependent. The approach is evaluated on a set of spontaneous utterances which have been recorded with our spoken dialogue system. A significant reduction of the word error rate has been achieved.
Keywords :
error statistics; feature extraction; hidden Markov models; speech processing; speech recognition; HMM; context-dependent output densities; dynamic features; large context; output distributions; phone recognizer; speech recognition; spontaneous utterances; state scores; temporal context; word error rate; word recognizer; Cepstral analysis; Computer hacking; Concurrent computing; Data mining; Error analysis; Feature extraction; Hidden Markov models; Pattern recognition; Random variables; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198886
Filename :
1198886
Link To Document :
بازگشت