• DocumentCode
    381271
  • Title

    Language models beyond word strings

  • Author

    Noth, Elmar ; Batliner, Anton ; Niemann, Heinrich ; Stemmer, Georg ; Gallwitz, Florian ; Spilker, J.

  • Author_Institution
    Lehrstuhl fur Mustererkennung, Erlangen-Nurnberg Univ., Erlangen, Germany
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    167
  • Lastpage
    176
  • Abstract
    In this paper we want to show how n-gram language models can be used to provide additional information in automatic speech understanding systems beyond the pure word chain. This becomes important in the context of conversational dialogue systems that have to recognize and interpret spontaneous speech. We show how n-grams can: (1) help to classify prosodic events like boundaries and accents; (2) be extended to directly provide boundary information in the speech recognition phase; (3) help to process speech repairs; and (4) detect and semantically classify out-of-vocabulary words. The approaches can work on the best word chain or a word hypotheses graph. Examples and experimental results are provided from our own research within the EVAR information retrieval system and the VERBMOBIL speech-to-speech translation system.
  • Keywords
    interactive systems; natural language interfaces; speech recognition; EVAR information retrieval system; VERBMOBIL speech-to-speech translation system; accents; automatic speech understanding systems; boundary information; conversational dialogue systems; language models; n-gram language models; out-of-vocabulary words; prosodic event classification; semantic classification; speech recognition; speech repairs; word chain; word hypotheses graph; word strings; Automatic speech recognition; Databases; Event detection; Natural languages; Phase detection; Speech analysis; Speech processing; Speech recognition; Stochastic systems; Virtual manufacturing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
  • Print_ISBN
    0-7803-7343-X
  • Type

    conf

  • DOI
    10.1109/ASRU.2001.1034614
  • Filename
    1034614