Title :
Integrating dynamic speech modalities into context decision trees
Author :
Fügen, Christian ; Rogina, Ivica
Author_Institution :
Interactive Syst. Lab., Karlsruhe Univ., Germany
Abstract :
Context decision trees are widely used in the speech recognition community. Besides questions about phonetic classes of a phone´s context, questions about their position within a word and questions about the gender of the current speaker have been used so far. In this paper we additionally incorporate questions about current modalities of the spoken utterance like the speaker´s dialect, the speaking rate, the signal to noise ratio, the latter two of which may change while speaking one utterance. We present a framework that treats all these modalities in a uniform way. Experiments with the Janus speech recognizer have produced error rate reductions of up to 10% when compared to systems that do not use modality questions
Keywords :
decision trees; speech recognition; Janus speech recognizer; context decision trees; current modalities; dynamic speech modalities; error rate reductions; phone´s context; phonetic classes; signal to noise ratio; speaker gender; speaker´s dialect; speaking rate; speech recognition community; Context modeling; Decision trees; Decoding; Error analysis; Interactive systems; Signal to noise ratio; Speech recognition; Table lookup; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.861810