Title :
Impact of I/O and execution scheduling strategies on large scale parallel data mining
Author :
Benjamas, Nunnapus ; Uthayopas, Putchong
Author_Institution :
Dept. of Comput. Sci., Khonkaen Univ., Khonkaen, Thailand
Abstract :
In the era of “Big Data”, there is an emerging need to process a massive data set using large cluster system. Anyway, without the right strategies to handle the data, it is challenging to gain a good performance from the system. In this paper, many I/O and execution scheduling strategies for parallel data mining application has been investigated. The goal is to discover strategies that balance the data processing load and better utilize a multi-core cluster system for data mining application. Issues that impact the performance have been explored. The simulation results show that a substantial performance improvement can be obtained especially with a multi-core cluster system when a proper I/O and task execution sequence scheduling has been employed.
Keywords :
data handling; data mining; pattern clustering; I/O strategies; big data; data handling; data processing load; execution scheduling strategies; large cluster system; large scale parallel data mining; multicore cluster system; I/O scheduling; data mining; execution scheduling; multi-core; parallel FI-growth;
Conference_Titel :
Information Science and Service Science and Data Mining (ISSDM), 2012 6th International Conference on New Trends in
Conference_Location :
Taipei
Print_ISBN :
978-1-4673-0876-2