• DocumentCode
    417186
  • Title

    Extending boosting for call classification using word confusion networks

  • Author

    Tur, Gokhan ; Hakkani-Tür, Dilek ; Riccardi, Giuseppe

  • Author_Institution
    AT&T Labs.-Res., USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We are interested in the problem of robust understanding from noisy spontaneous speech input. In goal driven human-machine dialog, utterance classification is a key component of the understanding process to determine the intent of the speaker. We propose a novel algorithm for exploiting ASR word confidence scores for better classification of spoken utterances. Word confidence scores for automatic speech recognition (ASR) provide estimates for word error rates. While previous work has focused on straightforward combination of word confidence scores into Bayesian classifiers, we extend the mathematical formulation for boosting classifiers. This extension of the algorithm allows confidence scores to be exploited from a 1-best ASR output or from word confusion networks (WCNs). We present methods for on-line and off-line score combinations. The results we show are for a large database of utterances collected using the AT&T VoiceToneSM spoken dialog system. Our experiments show between 5% and 10% reduction in error (1-precision) for a given recall using WCNs compared to ASR output.
  • Keywords
    error statistics; interactive systems; natural language interfaces; speech recognition; speech-based user interfaces; ASR word confidence scores; AT&T VoiceTone; Bayesian classifiers; boosting algorithm; boosting classifiers; call classification; human-machine dialog; noisy spontaneous speech; robust understanding; spoken dialog system; spoken utterance classification; word confusion networks; word error rate estimation; Automatic speech recognition; Bayesian methods; Boosting; Databases; Decoding; Error analysis; Iterative algorithms; Lattices; Man machine systems; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326016
  • Filename
    1326016