Title :
Techniques for Graph Analytics on Big Data
Author :
Nisar, M. Usman ; Fard, Arash ; Miller, John A.
Author_Institution :
Dept. of Comput. Sci., Univ. of Georgia, Athens, GA, USA
fDate :
June 27 2013-July 2 2013
Abstract :
Graphs enjoy profound importance because of their versatility and expressivity. They can be effectively used to represent social networks, web search engines and genome sequencing. The field of graph pattern matching has been of significant importance and has wide-spread applications. Conceptually, we want to find subgraphs that match a pattern in a given graph. Much work has been done in this field with solutions like Subgraph Isomorphism and Regular Expression matching. With Big Data, scientists are frequently running into massive graphs that have amplified the challenge that this area poses. We study the speedup and communication behavior of three distributed algorithms for inexact graph pattern matching. We also study the impact of different graph partitionings on runtime and network I/O. Our extensive results show that the algorithms exhibit excellent scalable behavior and min-cut partitioning can lead to improved performance under some circumstances, and can drastically reduce the network traffic as well.
Keywords :
computer graphics; data analysis; distributed algorithms; pattern matching; Web search engines; big data; distributed algorithm; genome sequencing; graph analytics; graph partitioning; graph pattern matching; min-cut partitioning; network input-output; regular expression matching; runtime input-output; social networks; subgraph isomorphism; Computational modeling; Data handling; Data models; Distributed algorithms; Global Positioning System; Information management; Pattern matching; graph analytics; big data; graph simulation; parallel and distributed algorithms;
Conference_Titel :
Big Data (BigData Congress), 2013 IEEE International Congress on
Conference_Location :
Santa Clara, CA
Print_ISBN :
978-0-7695-5006-0
DOI :
10.1109/BigData.Congress.2013.78