DocumentCode
3165459
Title
A statistical classification approach to question answering using Web data
Author
Whittaker, Edward ; Furui, Sadaoki ; Klakow, Dietrich
Author_Institution
Dept. of Comput. Sci., Tokyo Inst. of Technol.
fYear
2005
fDate
23-25 Nov. 2005
Lastpage
428
Abstract
In this paper we treat question answering (QA) as a classification problem. Our motivation is to build systems for many languages without the need for highly tuned linguistic modules. Consequently, word tokens and Web data are used extensively but no explicit linguistic knowledge is incorporated. A mathematical model for answer retrieval, answer classification and answer length prediction is derived. The TREC 2002 QA task is used for system development where 33% of questions are answered correctly. Performance is then evaluated on the factoid questions of the TREC 2003 QA task where 23% of questions were answered correctly, which would rank the system in the top 10 of contemporary QA systems on the same task
Keywords
Internet; classification; information retrieval; natural languages; statistical analysis; QA systems; TREC; Web data; answer classification; answer length prediction; answer retrieval; classification problem; factoid questions; question answering; statistical classification; Computer science; Data mining; Information analysis; Mathematical model; Robustness; Search engines; Statistical analysis; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Cyberworlds, 2005. International Conference on
Conference_Location
Singapore
Print_ISBN
0-7695-2378-1
Type
conf
DOI
10.1109/CW.2005.10
Filename
1587573
Link To Document