DocumentCode :
2727049
Title :
Question Answering over Implicitly Structured Web Content
Author :
Agichtein, Eugene ; Burges, Chris ; Brill, Eric
fYear :
2007
fDate :
2-5 Nov. 2007
Firstpage :
18
Lastpage :
25
Abstract :
Implicitly structured content on the Web such as HTML tables and lists can be extremely valuable for web search, question answering, and information retrieval, as the implicit structure in a page often reflects the underlying semantics of the data. Unfortunately, exploiting this information presents significant challenges due to the immense amount of implicitly structured content on the web, lack of schema information, and unknown source quality. We present TQA, a web-scale system for automatic question answering that is often able to find answers to real natural language questions from the implicitly structured content on the web. Our experiments over more than 200 million structures extracted from a partial web crawl demonstrate the promise of our approach.
Keywords :
Content based retrieval; Data mining; HTML; History; Information retrieval; Intelligent structures; Natural languages; Search engines; Statistics; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, IEEE/WIC/ACM International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3026-0
Type :
conf
DOI :
10.1109/WI.2007.130
Filename :
4427061
Link To Document :
بازگشت