• DocumentCode
    2681147
  • Title

    Addressing Data-Intensive Computing Problems with the Use of MapReduce on Heterogeneous Environments as Desktop Grid on Slow Links

  • Author

    Anjos, Julio C S ; Kolberg, W. ; Geyer, Claudio R. ; Arantes, Luciana B.

  • Author_Institution
    Inst. de Inf., UFRGS, Porto Alegre, Brazil
  • fYear
    2012
  • fDate
    17-19 Oct. 2012
  • Firstpage
    148
  • Lastpage
    155
  • Abstract
    The emergence of data volumes in the order of petabytes creates the need to develop new solutions that make possible the processing of data through the use of intensive computing systems, as MapReduce. MapReduce is a programming framework where the programmer is abstracted from the parallelization process. However, this model is optimized primarily in large clusters and it has a low performance on heterogeneous environments, with computational capacity machines different. The motivation of this work is to apply the data-intensive computing heterogeneous environments as desktop grid with use MapReduce model. Due to deficiencies of the MapReduce model in heterogeneous environments it was proposed the MR-A++: a MapReduce with algorithms adapted to heterogeneous environments. The MR-A++ model creates a training task to gather information prior to the distribution of data. Therefore the information will be used to manager the system. The small delay introduced in phase of setup of computing is compensated with the adequacy of heterogeneous environment through computational capacity of the machines. So the performance gains can be greater than 70% at 10 Mbps.
  • Keywords
    grid computing; parallel processing; MR-A++ model; MapReduce; computational capacity machines; data volumes; data-intensive computing heterogeneous environments; data-intensive computing problems; desktop grid; heterogeneous environments; parallelization process; petabytes; slow links; Adaptation models; Computational modeling; Data models; Delay; Programming; Software; Training; Data-Intensice Computing; Distributed System; MapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Systems (WSCAD-SSC), 2012 13th Symposium on
  • Conference_Location
    Petropolis
  • Print_ISBN
    978-1-4673-4468-5
  • Type

    conf

  • DOI
    10.1109/WSCAD-SSC.2012.18
  • Filename
    6391776