• DocumentCode
    1146696
  • Title

    Discriminative training of natural language call routers

  • Author

    Kuo, Hong-Kwang Jeff ; Lee, Chin-Hui

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    11
  • Issue
    1
  • fYear
    2003
  • fDate
    1/1/2003 12:00:00 AM
  • Firstpage
    24
  • Lastpage
    35
  • Abstract
    This paper shows how discriminative training can significantly improve classifiers used in natural language processing, using as an example the task of natural language call routing, where callers are transferred to desired departments based on natural spoken responses to an open-ended "How may I direct your call?" prompt. With vector-based natural language call routing, callers are transferred using a routing matrix trained on statistics of occurrence of words and word sequences in a training corpus. By re-training the routing matrix parameters using a minimum classification error criterion, a relative error rate reduction of 10-30% was achieved on a banking task. Increased robustness was demonstrated in that with 10% rejection, the error rate was reduced by 40%. Discriminative training also improves portability; we were able to train call routers with the highest known performance using as input only text transcription of routed calls, without any human intervention or knowledge about what terms are important or irrelevant for the routing task. This strategy was validated with both the banking task and a more difficult task involving calls to operators in the UK. The proposed formulation is applicable to algorithms addressing a broad range of speech understanding, information retrieval, and topic identification problems.
  • Keywords
    information retrieval; natural languages; signal classification; speech processing; telecommunication network routing; UK; banking; discriminative training; information retrieval; minimum classification error criterion; natural language processing; natural spoken responses; portability; relative error rate reduction; routing matrix parameters; speech understanding; text transcription; topic identification; training corpus; vector-based natural language call routing; word sequences; words occurrence statistics; Banking; Error analysis; Humans; Information retrieval; Natural language processing; Natural languages; Robustness; Routing; Speech; Statistics;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2002.807352
  • Filename
    1179375