• DocumentCode
    607273
  • Title

    Performance of scalable off-the-shelf hardware for data-intensive parallel processing using MapReduce

  • Author

    Ahmad Fadzil, Ahmad Firdaus ; Abdul Khalid, Noor Elaiza ; Manaf, Mazani

  • Author_Institution
    Fac. of Comput. & Math. Sci., Univ. Teknol. MARA (UiTM), Shah Alam, Malaysia
  • fYear
    2012
  • fDate
    3-5 Dec. 2012
  • Firstpage
    379
  • Lastpage
    384
  • Abstract
    Large data and information processing requires high processing power that usually involve supercomputers which are costly. MapReduce parallel framework introduces an automated way of distributing these large processes to many computers. This paper proposes to conduct preliminary studies on scalability using MapReduce as an automated parallel processing running on low-cost off-the-shelf hardware. The system architecture is built with collections of off-the-shelf hardware. The scalability test will be conducted by adding an off-the-shelf hardware one at a time to the architecture. MapReduce tool is used as a parallel framework to automatically distribute tasks according to available resources. Performance will be evaluated based on improvement in speedup. It is found that MapReduce is able to accommodate scalability of off-the-shelf hardware resources by automatically distributing tasks regardless of the number of hardware being added to the architecture.
  • Keywords
    parallel processing; MapReduce tool; automated parallel processing; data-intensive parallel processing; information processing; off-the-shelf hardware scalability; supercomputers; system architecture; MapReduce; Off-the-shelf hardware; Parallel processing; scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Convergence Technology (ICCCT), 2012 7th International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4673-0894-6
  • Type

    conf

  • Filename
    6530362