DocumentCode :
2545602
Title :
Combining N-gram retrieval with weights propagation on massive RDF graphs
Author :
Hu, He ; Du, Xiaoyong
fYear :
2012
fDate :
29-31 May 2012
Firstpage :
1181
Lastpage :
1185
Abstract :
N-gram approach takes the position information into account additionally and thus can offer higher accuracy in query answering than keyword based approaches and is widely used in IR and NLP. However, in large-scale RDF graphs, URIs instead of documents are the ranking and querying units; URIs are usually much shorter than documents, and different URIs are interlinked into a massive network. One shot n-gram querying is usually not good for the RDF data in many cases. In this paper, we present a hybrid framework which combines the n-gram retrieval with link analysis based weight propagation. The idea is to exploit the link structures in the RDF data graphs and propagate the one shot n-gram score weights along with these links. Large scale experiments using MapReduce on Billion Triples Challenge dataset show the hybrid framework achieves an 80.3% improvement in relevance scores over mere n-gram retrieval.
Keywords :
data structures; graph theory; information retrieval; Billion Triples Challenge dataset; MapReduce; N-gram retrieval; RDF data graphs; large-scale RDF graphs; massive RDF graphs; weights propagation; Conferences; Educational institutions; Indexes; Legged locomotion; Resource description framework; Semantics; USA Councils; Linked Data; MapReduce; N-gram; RDF Graphs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
Type :
conf
DOI :
10.1109/FSKD.2012.6233970
Filename :
6233970
Link To Document :
بازگشت