DocumentCode
394343
Title
A phone recognizer helps to recognize words better
Author
Stemmer, Georg ; Zeissler, Viktor ; Hacker, Christian ; Noth, Elmar ; Niemann, Heinrich
Author_Institution
Lehrstuhl fur Mustererkennung (Informatik 5), Erlangen-Nurnberg Univ., Erlangen, Germany
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
For most speech recognition systems dynamic features are the only way to incorporate temporal context into the output distributions of the HMM. In this paper we propose an efficient method to utilize a large context in the recognition process. State scores of a phone recognizer which runs in parallel to the word recognizer are computed. Integrating these scores in the HMM of the word recognizer makes their output densities context-dependent. The approach is evaluated on a set of spontaneous utterances which have been recorded with our spoken dialogue system. A significant reduction of the word error rate has been achieved.
Keywords
error statistics; feature extraction; hidden Markov models; speech processing; speech recognition; HMM; context-dependent output densities; dynamic features; large context; output distributions; phone recognizer; speech recognition; spontaneous utterances; state scores; temporal context; word error rate; word recognizer; Cepstral analysis; Computer hacking; Concurrent computing; Data mining; Error analysis; Feature extraction; Hidden Markov models; Pattern recognition; Random variables; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198886
Filename
1198886
Link To Document