Title :
Parallelized Similarity Flooding Algorithm for Processing Large Scale Graph Datasets with MapReduce
Author :
Jian Zhang ; Chunfeng Yuan ; Yihua Huang
Author_Institution :
Dept. of Comput. Sci. & Technol., Nanjing Univ., Nanjing, China
Abstract :
Measures of graph similarity have a broad range of applications but involve compute-intensive process. Similarity flooding algorithm is an efficient algorithm for comparing the similarity of graphs of small size and small datasets. However, nowadays more and more large-scale graph applications emerge and existing stand-alone similarity flooding algorithm cannot efficiently conduct the similarity comparison process for large scale graph datasets in acceptable time. This paper presents a parallelized similarity flooding algorithm with MapReduce for large-scale graph datasets. The experimental results demonstrate that the parallelized algorithm achieves significant performance improvement compared to the stand-alone similarity flooding algorithm. Experimental results also reveal that the parallelized algorithm can obtain excellent speedup when the size of cluster increases.
Keywords :
data handling; graph theory; parallel algorithms; MapReduce; cluster size; compute-intensive process; graph similarity; large-scale graph datasets; parallelized similarity flooding algorithm; Algorithm design and analysis; Clustering algorithms; Electronic publishing; Floods; Information services; Internet; Software algorithms; MapReduce; large-scale graph data; parallelized algorithm; similarity flooding algorithm;
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2012 13th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-0-7695-4879-1
DOI :
10.1109/PDCAT.2012.109