Title :
Distributed scalable RDFS reasoning
Author :
Jagvaral, Batselem ; Young-Tack Park
Author_Institution :
Comput. Sci. Dept., Soongsil Univ., Seoul, South Korea
Abstract :
A number of reasoning studies on big ontology have been carried out in the recent years. However, most of the existing studies have focused heavily on Hadoop MapReduce. In this paper, we propose a reasoning approach for Resource Description Framework Schema (RDFS) that employs optimized methods based on Spark. Spark is a general distributed inmemory framework for large-scale data processing that is not tied to the two-stage MapReduce paradigm. In our work, we devised an extensive optimization method to cope with the communication bottleneck of data shuffling between machine nodes in a distributed system. From empirical evaluations, the proposed reasoning system produces at most the throughput of 4166KT/sec which is almost 80% faster than the MapReduce based reasoner WebPIE.
Keywords :
Internet; data handling; inference mechanisms; ontologies (artificial intelligence); optimisation; Hadoop MapReduce; MapReduce based reasoner WebPIE; Spark; data shuffling; distributed scalable RDFS reasoning; extensive optimization method; general distributed inmemory framework; large-scale data processing; machine nodes; ontology; resource description framework schema; Big data; Cognition; Distributed databases; Ontologies; Resource description framework; Sparks; Throughput; Distributed System; Ontology Reasoning; RDF; RDFS; Spark;
Conference_Titel :
Big Data and Smart Computing (BigComp), 2015 International Conference on
Conference_Location :
Jeju
DOI :
10.1109/35021BIGCOMP.2015.7072845