A phone recognizer helps to recognize words better

Author

Stemmer, Georg ; Zeissler, Viktor ; Hacker, Christian ; Noth, Elmar ; Niemann, Heinrich

Author_Institution

Lehrstuhl fur Mustererkennung (Informatik 5), Erlangen-Nurnberg Univ., Erlangen, Germany

Volume

1

fYear

2003

fDate

6-10 April 2003

Abstract

For most speech recognition systems dynamic features are the only way to incorporate temporal context into the output distributions of the HMM. In this paper we propose an efficient method to utilize a large context in the recognition process. State scores of a phone recognizer which runs in parallel to the word recognizer are computed. Integrating these scores in the HMM of the word recognizer makes their output densities context-dependent. The approach is evaluated on a set of spontaneous utterances which have been recorded with our spoken dialogue system. A significant reduction of the word error rate has been achieved.

Keywords

error statistics; feature extraction; hidden Markov models; speech processing; speech recognition; HMM; context-dependent output densities; dynamic features; large context; output distributions; phone recognizer; speech recognition; spontaneous utterances; state scores; temporal context; word error rate; word recognizer; Cepstral analysis; Computer hacking; Concurrent computing; Data mining; Error analysis; Feature extraction; Hidden Markov models; Pattern recognition; Random variables; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-7663-3

Type

conf

DOI

10.1109/ICASSP.2003.1198886

Filename

1198886