Title :
Extracting answers to natural language questions from large-scale corpus
Author :
Li, Peng ; Wang, Xiaolong ; Guan, Yi ; Zhao, YuMing
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., China
fDate :
30 Oct.-1 Nov. 2005
Abstract :
This paper provides a novel and tractable method for extracting exact textual answers from the returned documents that are retrieved by traditional IR system in large-scale collection of texts. In our approach, WordNet and Web information are employed to improve the performance as external auxiliary resources, then some NLP technologies are used to constitute the empirical answer ranking formula, such as POS tagging, Named Entity recognition, and parser etc. The method involves automatically ranking passages with System Similarity Model, automatically downloading related Web pages by means of Web crawler, and automatically mining answers with empirical formula from candidate answer sets. The series of experimental results show that the overall performance of our system is good and the structure of the system is reasonable.
Keywords :
Internet; data mining; information retrieval; natural languages; text analysis; NLP technology; Named Entity recognition; POS tagging; System Similarity Model; Web pages; information retrieval system; large-scale corpus; natural language question; parser; textual answer extraction; Computer science; Crawlers; Data mining; Information retrieval; Large-scale systems; Moon; Natural language processing; Natural languages; Tagging; Web pages;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598824