DocumentCode
353511
Title
Integrating dynamic speech modalities into context decision trees
Author
Fügen, Christian ; Rogina, Ivica
Author_Institution
Interactive Syst. Lab., Karlsruhe Univ., Germany
Volume
3
fYear
2000
fDate
2000
Firstpage
1277
Abstract
Context decision trees are widely used in the speech recognition community. Besides questions about phonetic classes of a phone´s context, questions about their position within a word and questions about the gender of the current speaker have been used so far. In this paper we additionally incorporate questions about current modalities of the spoken utterance like the speaker´s dialect, the speaking rate, the signal to noise ratio, the latter two of which may change while speaking one utterance. We present a framework that treats all these modalities in a uniform way. Experiments with the Janus speech recognizer have produced error rate reductions of up to 10% when compared to systems that do not use modality questions
Keywords
decision trees; speech recognition; Janus speech recognizer; context decision trees; current modalities; dynamic speech modalities; error rate reductions; phone´s context; phonetic classes; signal to noise ratio; speaker gender; speaker´s dialect; speaking rate; speech recognition community; Context modeling; Decision trees; Decoding; Error analysis; Interactive systems; Signal to noise ratio; Speech recognition; Table lookup; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.861810
Filename
861810
Link To Document