Title :
Query understanding enhanced by hierarchical parsing structures
Author :
Jingjing Liu ; Pasupat, Panupong ; Yining Wang ; Cyphers, Scott ; Glass, James
Author_Institution :
MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
Abstract :
Query understanding has been well studied in the areas of information retrieval and spoken language understanding (SLU). There are generally three layers of query understanding: domain classification, user intent detection, and semantic tagging. Classifiers can be applied to domain and intent detection in real systems, and semantic tagging (or slot filling) is commonly defined as a sequence-labeling task - mapping a sequence of words to a sequence of labels. Various statistical features (e.g., n-grams) can be extracted from annotated queries for learning label prediction models; however, linguistic characteristics of queries, such as hierarchical structures and semantic relationships, are usually neglected in the feature extraction process. In this work, we propose an approach that leverages linguistic knowledge encoded in hierarchical parse trees for query understanding. Specifically, for natural language queries, we extract a set of syntactic structural features and semantic dependency features from query parse trees to enhance inference model learning. Experiments on real natural language queries show that augmenting sequence labeling models with linguistic knowledge can improve query understanding performance in various domains.
Keywords :
computational linguistics; feature extraction; grammars; natural language processing; query processing; statistical analysis; SLU; domain classification; feature extraction; hierarchical parse trees; hierarchical parsing structure; inference model learning; information retrieval; label prediction model; linguistic characteristic; linguistic knowledge; n-grams; natural language queries; query parse trees; query understanding; semantic dependency feature; semantic tagging; sequence-labeling task; slot filling; spoken language understanding; statistical feature; syntactic structural feature; user intent detection; Feature extraction; Motion pictures; Natural languages; Pragmatics; Semantics; Syntactics; Tagging; linguistic parsing; query understanding; semantic tagging;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location :
Olomouc
DOI :
10.1109/ASRU.2013.6707708