Title : 
Post-analysis of Keyword-Based Search Results Using Entity Mining, Linked Data, and Link Analysis at Query Time
         
        
            Author : 
Fafalios, Pavlos ; Tzitzikas, Yannis
         
        
            Author_Institution : 
Inst. of Comput. Sci., FORTH-ICS, Heraklion, Greece
         
        
        
        
        
        
            Abstract : 
The integration of the classical Web (of documents) with the emerging Web of Data is a challenging vision. In this paper we focus on an integration approach during searching which aims at enriching the responses of non-semantic search systems (e.g. professional search systems, web search engines) with semantic information, i.e. Linked Open Data (LOD), and exploiting the outcome for providing an overview of the search space and allowing the users (apart from restricting it) to explore the related LOD. We use named entities (e.g. persons, locations, etc.) as the "glue" for automatically connecting search hits with LOD. We consider a scenario where this entity-based integration is performed at query time with no human effort, and no a-priori indexing, which is beneficial in terms of configurability and freshness. To realize this scenario one has to tackle various challenges. One spiny issue is that the number of identified entities can be high, the same is true for the semantic information about these entities that can be fetched from the available LOD (i.e. their properties and associations with other entities). To this end, in this paper we propose a Link Analysis-based method which is used for (a) ranking (and thus selecting to show) the more important semantic information related to the search results, (b) deriving and showing top-K semantic graphs. In the sequel, we report the results of a survey regarding the marine domain with promising results, and comparative results that illustrate the effectiveness of the proposed (Page Rank-based) ranking scheme. Finally, we report experimental results regarding efficiency showing that the proposed functionality can be offered even at query time.
         
        
            Keywords : 
Internet; data mining; information analysis; query processing; LOD; Page Rank-based ranking scheme; Web of Data; entity mining; keyword-based search results; link analysis; linked open data; named entities; nonsemantic search systems; query time; search space; semantic information; top-K semantic graphs; Engines; Knowledge based systems; Resource description framework; Search problems; Semantics; Web pages; Web search; entity mining; link analysis; linked data; results post-analysis;
         
        
        
        
            Conference_Titel : 
Semantic Computing (ICSC), 2014 IEEE International Conference on
         
        
            Conference_Location : 
Newport Beach, CA
         
        
            Print_ISBN : 
978-1-4799-4002-8
         
        
        
            DOI : 
10.1109/ICSC.2014.11