Title :
Applying MapReduce principle to high level information fusion
Author :
Laudy, Claire ; Dreo, Johann ; Gouguenheim, Christophe
Abstract :
The InSyTo Synthesis framework is based on graph structures, graph algorithms and similarity measures for soft data fusion managing inconsistencies. The framework can be used to enable non-redundant additions to an information network, as well as graph based information query on several applications. The graph fusion algorithm relies on the search of a maximum common subgraph isomorphism, which makes it a difficult problem, especially on large graphs. In this work, the subgraph matching algorithm is partially parallelized, based on the MapReduce approach and on the Hadoop framework. Using Hadoop enables the management of big graphs, first by avoiding the load of the graphs in memory and secondly by distributing the computations over several processing nodes. Our experiments on the Global Terrorism Database (which contains the descriptions of more than 113,000 terrorist attacks in a graph of more than 20,000,000 nodes) shows that InSyTo Synthesis now scales to so-called "big data" applications.
Keywords :
Big Data; graph theory; parallel programming; sensor fusion; Big Data"applications; Hadoop framework; InSyTo Synthesis framework; MapReduce approach; MapReduce principle; big graph management; global terrorism database; graph based information query; graph fusion algorithm; graph structures; high level information fusion; information network; processing nodes; similarity measures; soft data fusion; subgraph isomorphism; subgraph matching algorithm; Big data; Context; Data integration; Data models; Semantics; Terrorism; Vocabulary;
Conference_Titel :
Information Fusion (FUSION), 2014 17th International Conference on
Conference_Location :
Salamanca