DocumentCode
607273
Title
Performance of scalable off-the-shelf hardware for data-intensive parallel processing using MapReduce
Author
Ahmad Fadzil, Ahmad Firdaus ; Abdul Khalid, Noor Elaiza ; Manaf, Mazani
Author_Institution
Fac. of Comput. & Math. Sci., Univ. Teknol. MARA (UiTM), Shah Alam, Malaysia
fYear
2012
fDate
3-5 Dec. 2012
Firstpage
379
Lastpage
384
Abstract
Large data and information processing requires high processing power that usually involve supercomputers which are costly. MapReduce parallel framework introduces an automated way of distributing these large processes to many computers. This paper proposes to conduct preliminary studies on scalability using MapReduce as an automated parallel processing running on low-cost off-the-shelf hardware. The system architecture is built with collections of off-the-shelf hardware. The scalability test will be conducted by adding an off-the-shelf hardware one at a time to the architecture. MapReduce tool is used as a parallel framework to automatically distribute tasks according to available resources. Performance will be evaluated based on improvement in speedup. It is found that MapReduce is able to accommodate scalability of off-the-shelf hardware resources by automatically distributing tasks regardless of the number of hardware being added to the architecture.
Keywords
parallel processing; MapReduce tool; automated parallel processing; data-intensive parallel processing; information processing; off-the-shelf hardware scalability; supercomputers; system architecture; MapReduce; Off-the-shelf hardware; Parallel processing; scalability;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing and Convergence Technology (ICCCT), 2012 7th International Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4673-0894-6
Type
conf
Filename
6530362
Link To Document