• DocumentCode
    353511
  • Title

    Integrating dynamic speech modalities into context decision trees

  • Author

    Fügen, Christian ; Rogina, Ivica

  • Author_Institution
    Interactive Syst. Lab., Karlsruhe Univ., Germany
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1277
  • Abstract
    Context decision trees are widely used in the speech recognition community. Besides questions about phonetic classes of a phone´s context, questions about their position within a word and questions about the gender of the current speaker have been used so far. In this paper we additionally incorporate questions about current modalities of the spoken utterance like the speaker´s dialect, the speaking rate, the signal to noise ratio, the latter two of which may change while speaking one utterance. We present a framework that treats all these modalities in a uniform way. Experiments with the Janus speech recognizer have produced error rate reductions of up to 10% when compared to systems that do not use modality questions
  • Keywords
    decision trees; speech recognition; Janus speech recognizer; context decision trees; current modalities; dynamic speech modalities; error rate reductions; phone´s context; phonetic classes; signal to noise ratio; speaker gender; speaker´s dialect; speaking rate; speech recognition community; Context modeling; Decision trees; Decoding; Error analysis; Interactive systems; Signal to noise ratio; Speech recognition; Table lookup; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.861810
  • Filename
    861810