Integrating dynamic speech modalities into context decision trees

Author

Fügen, Christian ; Rogina, Ivica

Author_Institution

Interactive Syst. Lab., Karlsruhe Univ., Germany

Volume

3

fYear

2000

fDate

2000

Firstpage

1277

Abstract

Context decision trees are widely used in the speech recognition community. Besides questions about phonetic classes of a phone´s context, questions about their position within a word and questions about the gender of the current speaker have been used so far. In this paper we additionally incorporate questions about current modalities of the spoken utterance like the speaker´s dialect, the speaking rate, the signal to noise ratio, the latter two of which may change while speaking one utterance. We present a framework that treats all these modalities in a uniform way. Experiments with the Janus speech recognizer have produced error rate reductions of up to 10% when compared to systems that do not use modality questions

Keywords

decision trees; speech recognition; Janus speech recognizer; context decision trees; current modalities; dynamic speech modalities; error rate reductions; phone´s context; phonetic classes; signal to noise ratio; speaker gender; speaker´s dialect; speaking rate; speech recognition community; Context modeling; Decision trees; Decoding; Error analysis; Interactive systems; Signal to noise ratio; Speech recognition; Table lookup; Training data; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on

Conference_Location

Istanbul

ISSN

1520-6149

Print_ISBN

0-7803-6293-4

Type

conf

DOI

10.1109/ICASSP.2000.861810

Filename

861810