• DocumentCode
    865710
  • Title

    Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics

  • Author

    Lane, Ian ; Kawahara, Tatsuya ; Matsui, Tomoko ; Nakamura, Satoshi

  • Author_Institution
    ATR Spoken Language Commun. Res. Labs., Kyoto
  • Volume
    15
  • Issue
    1
  • fYear
    2007
  • Firstpage
    150
  • Lastpage
    161
  • Abstract
    One significant problem for spoken language systems is how to cope with users\´ out-of-domain (OOD) utterances which cannot be handled by the back-end application system. In this paper, we propose a novel OOD detection framework, which makes use of the classification confidence scores of multiple topics and applies a linear discriminant model to perform in-domain verification. The verification model is trained using a combination of deleted interpolation of the in-domain data and minimum-classification-error training, and does not require actual OOD data during the training process, thus realizing high portability. When applied to the "phrasebook" system, a single utterance read-style speech task, the proposed approach achieves an absolute reduction in OOD detection errors of up to 8.1 points (40% relative) compared to a baseline method based on the maximum topic classification score. Furthermore, the proposed approach realizes comparable performance to an equivalent system trained on both in-domain and OOD data, while requiring no OOD data during training. We also apply this framework to the "machine-aided-dialogue" corpus, a spontaneous dialogue speech task, and extend the framework in two manners. First, we introduce topic clustering which enables reliable topic confidence scores to be generated even for indistinct utterances, and second, we implement methods to effectively incorporate dialogue context. Integration of these two methods into the proposed framework significantly improves OOD detection performance, achieving a further reduction in equal error rate (EER) of 7.9 points
  • Keywords
    natural language processing; speech recognition; in-domain verification; linear discriminant model; machine-aided-dialogue corpus; minimum-classification-error training; out-of-domain utterance detection; phrasebook system; single utterance read-style speech task; speech recognition; spoken language understanding; Communications technology; Error analysis; Informatics; Interpolation; Laboratories; Mathematical model; Mathematics; Natural language processing; Natural languages; Speech recognition; Confidence measures; out-of-domain (OOD) utterance detection; speech recognition; spoken language understanding; topic classification; topic clustering;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2006.876727
  • Filename
    4032779