Title :
BC-BSP: A BSP-Based System with Disk Cache for Large-Scale Graph Processing
Author :
Yubin Bao ; Zhigang Wang ; Qiushi Bai ; Yu Gu ; Ge Yu ; Hongxu Zhang ; Chao Deng ; Leitao Guo
Author_Institution :
Sch. of Inf. Sci & Eng., Northeastern Univ., Shenyang, China
Abstract :
Many applications in real life can be modeled by Graph, and the data scale is very large in many fields. People have paid more attention to large-scale graph processing. A BSP-based system with disk cache for large-scale graph processing is proposed in this paper. The system has the ability to expand the functions and strategies (such as adjusting the parameters according to the volume of data and supporting multiple aggregation functions at the same time), to process large-scale data, to balance load, and to run clustering or classification algorithms on metric datasets. Some experiments are done to evaluate the scalability of the system implemented in the paper, and the comparison between BC-BSP-based applications and MapReduce-based ones are made. The experimental results show that BSP-based applications have higher efficiency than the MapReduce-based applications when the volume of data can be put in the memory during the course of processing; on the contrary the latter is better than the former.
Keywords :
cache storage; parallel processing; pattern classification; pattern clustering; resource allocation; BC-BSP based system; bulk synchronous parallel; classification algorithm; clustering algorithm; disk cache; large-scale data processing; large-scale graph processing; load balancing; metric datasets; Algorithm design and analysis; Computational modeling; Fault tolerance; Fault tolerant systems; Scalability; Synchronization; Web pages; BSP; Large-Scale Graph Processing; MapReduce;
Conference_Titel :
Open Cirrus Summit (OCS), 2012 Seventh
Conference_Location :
Beijing
DOI :
10.1109/OCS.2012.37