Title :
Combining N-gram retrieval with weights propagation on massive RDF graphs
Author :
Hu, He ; Du, Xiaoyong
Abstract :
N-gram approach takes the position information into account additionally and thus can offer higher accuracy in query answering than keyword based approaches and is widely used in IR and NLP. However, in large-scale RDF graphs, URIs instead of documents are the ranking and querying units; URIs are usually much shorter than documents, and different URIs are interlinked into a massive network. One shot n-gram querying is usually not good for the RDF data in many cases. In this paper, we present a hybrid framework which combines the n-gram retrieval with link analysis based weight propagation. The idea is to exploit the link structures in the RDF data graphs and propagate the one shot n-gram score weights along with these links. Large scale experiments using MapReduce on Billion Triples Challenge dataset show the hybrid framework achieves an 80.3% improvement in relevance scores over mere n-gram retrieval.
Keywords :
data structures; graph theory; information retrieval; Billion Triples Challenge dataset; MapReduce; N-gram retrieval; RDF data graphs; large-scale RDF graphs; massive RDF graphs; weights propagation; Conferences; Educational institutions; Indexes; Legged locomotion; Resource description framework; Semantics; USA Councils; Linked Data; MapReduce; N-gram; RDF Graphs;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
DOI :
10.1109/FSKD.2012.6233970