DocumentCode
8178
Title
Maiter: An Asynchronous Graph Processing Framework for Delta-Based Accumulative Iterative Computation
Author
Yanfeng Zhang ; Qixin Gao ; Lixin Gao ; Cuirong Wang
Author_Institution
Northeastern Univ., Shenyang, China
Volume
25
Issue
8
fYear
2014
fDate
Aug. 2014
Firstpage
2091
Lastpage
2100
Abstract
Myriad of graph-based algorithms in machine learning and data mining require parsing relational data iteratively. These algorithms are implemented in a large-scale distributed environment to scale to massive data sets. To accelerate these large-scale graph-based iterative computations, we propose delta-based accumulative iterative computation (DAIC). Different from traditional iterative computations, which iteratively update the result based on the result from the previous iteration, DAIC updates the result by accumulating the “changes” between iterations. By DAIC, we can process only the “changes” to avoid the negligible updates. Furthermore, we can perform DAIC asynchronously to bypass the high-cost synchronous barriers in heterogeneous distributed environments. Based on the DAIC model, we design and implement an asynchronous graph processing framework, Maiter. We evaluate Maiter on local cluster as well as on Amazon EC2 Cloud. The results show that Maiter achieves as much as 60 × speedup over Hadoop and outperforms other state-of-the-art frameworks.
Keywords
cloud computing; data mining; graph theory; iterative methods; learning (artificial intelligence); Amazon EC2 cloud; DAIC; Hadoop; Maiter framework; asynchronous graph processing framework; data mining; delta-based accumulative iterative computation; large-scale distributed environment; machine learning; relational data parsing; Acceleration; Convergence; Educational institutions; Equations; Iterative methods; Processor scheduling; Synchronization; Delta-based accumulative iterative computation; asynchronous iteration; distributed framework; maiter;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2013.235
Filename
6600686
Link To Document