DocumentCode :
3481756
Title :
Map reduction framework for parallel data mining: Multicore to distributed network systems
Author :
Ramakrishna, Rahul ; Rao, M. V Bhaskara
Author_Institution :
Comput. Sci., Sir M Visvesvaraya Inst. of Technol., Bangalore, India
fYear :
2011
fDate :
5-6 Dec. 2011
Firstpage :
375
Lastpage :
379
Abstract :
In this multi core era, there is a huge influx of symmetric multi-process computers based on shared memory architecture and high end server platforms. It appears no adequate framework exists to manifest the complete potential of the hardware. In this paper a parallel programming framework is demonstrated applicable to different algorithms in a distinctive way from the conventional single algorithm speedup at a particular point of time. The framework fosters application dependent speedup over uniprocessor applications for a given workload and even on small Ethernet/IP based networks. Functional programming paradigm has the ability to implicitly parallelize program to multicore computers and scaled in distributed networks using a message queues. Also the map reduction framework is based on functional programming paradigm, where the programs can be written in summation form, specifying a map function which generates intermediate key value pairs and a reduce function merging the key value pairs. With this method a substantial increase in speed efficiency is obtained. However, the framework by itself will not substantially increase the speed of execution, as other parameters like chunking of data affect the performance metrics. Graphical methods are used and explained in order to show the optimum amount of chunking to be used for execution.
Keywords :
IP networks; data mining; functional programming; local area networks; parallel programming; shared memory systems; Ethernet-IP based networks; application dependent speedup; data chunking; distributed network systems; functional programming paradigm; graphical methods; high end server platforms; key value pairs; map reduction framework; message queues; multicore computers; multicore systems; parallel data mining; parallel programming framework; performance metrics; shared memory architecture; symmetric multiprocess computers; uniprocessor applications; Clustering algorithms; Conferences; Functional programming; Hardware; Multicore processing; Pipelines; Functional programming; map-reduce; message queues; speed efficiency; symmetric multi-process computers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Humanities, Science and Engineering (CHUSER), 2011 IEEE Colloquium on
Conference_Location :
Penang
Print_ISBN :
978-1-4673-0021-6
Type :
conf
DOI :
10.1109/CHUSER.2011.6163754
Filename :
6163754
Link To Document :
بازگشت