• DocumentCode
    589136
  • Title

    Comparison of the Efficiency of MapReduce and Bulk Synchronous Parallel Approaches to Large Network Processing

  • Author

    Kajdanowicz, T. ; Indyk, W. ; Kazienko, P. ; Kukul, J.

  • Author_Institution
    Inst. of Inf., Wroclaw Univ. of Technol., Wroclaw, Poland
  • fYear
    2012
  • fDate
    10-10 Dec. 2012
  • Firstpage
    218
  • Lastpage
    225
  • Abstract
    Network structures, especially social networks, grow rapidly and provide huge datasets intractable to analyse. In this paper, two parallel approaches to process large graph structures within the Hadoop environment were compared: Bulk Synchronous Parallel (BSP) and MapReduce (MR). The experimental studies were carried out for two different graph problems: collective classification by means of Relational Influence Propagation (RIP) and Single Source Shortest Path (SSSP) calculation. The appropriate BSP and MapReduce algorithms for these problems were applied to various network datasets differing in size and structural profile, originating from three domains: telecommunication, multimedia and microblog. The collected results revealed that iterative graph processing with BSP implementation significantly outperform popular MapReduce, especially for algorithms with many iterations and sparse communication. However, MapReduce still remains the only alternative for enormous networks.
  • Keywords
    graph theory; parallel processing; pattern classification; social networking (online); BSP; Hadoop environment; MR; MapReduce; RIP; SSSP calculation; bulk synchronous parallel approach; collective classification; efficiency comparison; iterative graph processing; large graph structures; large network processing; microblog; multimedia; relational influence propagation; single source shortest path calculation; social network structures; telecommunication; Big Data; Bulk Synchronous Parallel; Cloud Computing; Collective Classification; Large Graph Processing; MapReduce; Networked Data; Parallel Processing; Shortest Path;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • Print_ISBN
    978-1-4673-5164-5
  • Type

    conf

  • DOI
    10.1109/ICDMW.2012.135
  • Filename
    6406444