DocumentCode :
3423807
Title :
Machine Learning for Question Answering from Tabular Data
Author :
Khalid, Mahboob Alam ; Jijkoun, Valentin ; de Rijke, Maarten
Author_Institution :
Univ. of Amsterdam, Amsterdam
fYear :
2007
fDate :
3-7 Sept. 2007
Firstpage :
392
Lastpage :
396
Abstract :
Question Answering (QA) systems automatically answer natural language questions in a human-like manner. One of the practical approaches to open domain QA consists in extracting facts from free text offline and using a lookup mechanism when answering user´s questions online. This approach is related to natural language interfaces to databases (NLIDBs) that were studied extensively from the 1970s to the 1990s. NLIDB systems employed a range of techniques, from simple pattern-matching rules to formal logical calculi such as the lambda calculus, but most were restricted to specific domains. In this paper we describe a machine learning approach to querying tabular data for QA which is not restricted to specific domains. Our approach consists of two steps: for an incoming question, we first use a classifier to identify appropriate tables and columns in a structured database, and then employ a free-text retrieval to look up answers. The system uses part-of-speech tagging, named-entity normalization and a statistical classifier trained on data from the TREC QA task. With the TREC QA data, our system is shown to significantly outperform an existing rule-based table lookup method.
Keywords :
information retrieval; learning (artificial intelligence); natural language interfaces; pattern matching; formal logical calculi; free-text retrieval; human-like manner; lambda calculus; lookup mechanism; machine learning; natural language interfaces; natural language questions; part-of-speech tagging; pattern-matching rules; question answering systems; rule-based table lookup method; structured database; tabular data; Calculus; Data mining; Databases; Expert systems; Information retrieval; Machine learning; Natural languages; Table lookup; Tagging; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 2007. DEXA '07. 18th International Workshop on
Conference_Location :
Regensburg
ISSN :
1529-4188
Print_ISBN :
978-0-7695-2932-5
Type :
conf
DOI :
10.1109/DEXA.2007.119
Filename :
4312923
Link To Document :
بازگشت