DocumentCode
2664628
Title
BC-BSP: A BSP-Based System with Disk Cache for Large-Scale Graph Processing
Author
Yubin Bao ; Zhigang Wang ; Qiushi Bai ; Yu Gu ; Ge Yu ; Hongxu Zhang ; Chao Deng ; Leitao Guo
Author_Institution
Sch. of Inf. Sci & Eng., Northeastern Univ., Shenyang, China
fYear
2012
fDate
19-20 June 2012
Firstpage
35
Lastpage
39
Abstract
Many applications in real life can be modeled by Graph, and the data scale is very large in many fields. People have paid more attention to large-scale graph processing. A BSP-based system with disk cache for large-scale graph processing is proposed in this paper. The system has the ability to expand the functions and strategies (such as adjusting the parameters according to the volume of data and supporting multiple aggregation functions at the same time), to process large-scale data, to balance load, and to run clustering or classification algorithms on metric datasets. Some experiments are done to evaluate the scalability of the system implemented in the paper, and the comparison between BC-BSP-based applications and MapReduce-based ones are made. The experimental results show that BSP-based applications have higher efficiency than the MapReduce-based applications when the volume of data can be put in the memory during the course of processing; on the contrary the latter is better than the former.
Keywords
cache storage; parallel processing; pattern classification; pattern clustering; resource allocation; BC-BSP based system; bulk synchronous parallel; classification algorithm; clustering algorithm; disk cache; large-scale data processing; large-scale graph processing; load balancing; metric datasets; Algorithm design and analysis; Computational modeling; Fault tolerance; Fault tolerant systems; Scalability; Synchronization; Web pages; BSP; Large-Scale Graph Processing; MapReduce;
fLanguage
English
Publisher
ieee
Conference_Titel
Open Cirrus Summit (OCS), 2012 Seventh
Conference_Location
Beijing
Type
conf
DOI
10.1109/OCS.2012.37
Filename
6695837
Link To Document