Title :
Employing web search query click logs for multi-domain spoken language understanding
Author :
Hakkani-Tür, Dilek ; Tur, Gokhan ; Heck, Larry ; Celikyilmaz, Asli ; Fidler, Ashley ; Hillard, Dustin ; Iyer, Rukmini ; Parthasarathy, Sarangarajan
Author_Institution :
Speech Labs., Microsoft, Mountain View, CA, USA
Abstract :
Logs of user queries from a search engine (such as Bing or Google) together with the links clicked provide valuable implicit feedback to improve statistical spoken language understanding (SLU) models. In this work, we propose to enrich the existing classification feature set for domain detection with features computed using the click distribution over a set of clicked URLs from search query click logs (QCLs) of user utterances. Since the form of natural language utterances differs stylistically from that of keyword search queries, to be able to match natural language utterances with related search queries, we perform a syntax-based transformation of the original utterances, after filtering out domain-independent salient phrases. This approach results in significant improvements for domain detection, especially when detecting the domains of web-related user utterances.
Keywords :
Internet; filtering theory; natural language processing; query formulation; search engines; signal classification; speech processing; Bing; Google; Web search query click logs; Web-related user utterances; classification feature set; click distribution; clicked URL; domain detection; domain-independent salient phrases filtering; implicit feedback; keyword search query; multidomain spoken language understanding; natural language utterance matching; natural language utterances; search engine; statistical spoken language understanding model; syntax-based transformation; Error analysis; Feature extraction; Keyword search; Natural languages; Quantum cascade lasers; Training; Web search;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
DOI :
10.1109/ASRU.2011.6163968