Title : 
Question Answering over Implicitly Structured Web Content
         
        
            Author : 
Agichtein, Eugene ; Burges, Chris ; Brill, Eric
         
        
        
        
        
        
            Abstract : 
Implicitly structured content on the Web such as HTML tables and lists can be extremely valuable for web search, question answering, and information retrieval, as the implicit structure in a page often reflects the underlying semantics of the data. Unfortunately, exploiting this information presents significant challenges due to the immense amount of implicitly structured content on the web, lack of schema information, and unknown source quality. We present TQA, a web-scale system for automatic question answering that is often able to find answers to real natural language questions from the implicitly structured content on the web. Our experiments over more than 200 million structures extracted from a partial web crawl demonstrate the promise of our approach.
         
        
            Keywords : 
Content based retrieval; Data mining; HTML; History; Information retrieval; Intelligent structures; Natural languages; Search engines; Statistics; Web search;
         
        
        
        
            Conference_Titel : 
Web Intelligence, IEEE/WIC/ACM International Conference on
         
        
            Conference_Location : 
Fremont, CA
         
        
            Print_ISBN : 
978-0-7695-3026-0
         
        
        
            DOI : 
10.1109/WI.2007.130