• DocumentCode
    3442907
  • Title

    Active learning for spoken language understanding

  • Author

    Tur, Gokhan ; Schapire, Robert E. ; Hakkani-Tur, Dilek

  • Author_Institution
    AT&T Labs.-Res., USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    We describe active learning methods for reducing the labeling effort in a statistical call classification system. Active learning aims to minimize the number of labeled utterances by automatically selecting for labeling the utterances that are likely to be most informative. The first method, inspired by certainty-based active learning, selects the examples that the classifier is least confident about. The second method, inspired by committee-based active learning, selects the examples that multiple classifiers do not agree on. We have evaluated these active learning methods using a call classification system used for AT&T customer care. Our results indicate that it is possible to reduce human labeling effort at least by a factor of two.
  • Keywords
    learning (artificial intelligence); natural languages; signal classification; speech recognition; statistical analysis; AT&T customer care; call classification system; certainty-based active learning; committee-based active learning; human labeling effort reduction; labeling effort reduction; learning methods; multiple classifiers; spoken language understanding; spoken natural language; statistical call classification system; voice-based natural dialog systems; Computer science; Cost function; Humans; Labeling; Learning systems; Natural languages; Sampling methods; Sorting; Speech; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198771
  • Filename
    1198771