Title :
A discriminative model based entity dictionary weighting approach for spoken language understanding
Author :
Xiaohu Liu ; Sarikaya, Ruhi
Author_Institution :
Microsoft Corp., Redmond, WA, USA
Abstract :
Spoken language understanding (SLU) systems use various features to detect the domain, intent and semantic slots of a query. In addition to n-grams, features generated from entity dictionaries are often used in model training. Clean or properly weighted dictionaries are critical to improve model´s coverage and accuracy for unseen entities during test time. However, clean dictionaries are hard to obtain for some applications since they are automatically generated and can potentially contain millions of entries (e.g. movie names, person names) with significant noise in them. This paper proposes a discriminative model based approach to weight entities in noisy dictionaries using multiple knowledge resources. The model makes use of features extracted from query click logs, knowledge graph and live search results for accurate entity weighting. Experiments for both intent detection and slots tagging tasks in entertainment search covering five domains show significant gains over the baselines.
Keywords :
feature extraction; natural language processing; query processing; SLU systems; discriminative model based entity dictionary weighting approach; feature extraction; intent detection; knowledge graph; live search results; noisy dictionaries; query click logs; slot tagging tasks; spoken language understanding; Dictionaries; Feature extraction; Motion pictures; Noise; Search engines; Semantics; Support vector machines; Spoken language understanding; knowledge graphs; named entity lists; query click logs;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078573