DocumentCode :
2418936
Title :
A Data Mapping Strategy of Parallel Data Mining in Grid
Author :
Yonghong, Luo ; Tefang, Chen
Author_Institution :
Sch. of Traffic & Transp. Eng., Central South Univ., Changsha, China
fYear :
2010
fDate :
7-9 May 2010
Firstpage :
1377
Lastpage :
1382
Abstract :
In order to achieve data load balancing of mining nodes in parallel data mining based on grid, this paper proposes a data mapping strategy based on minimum time cost: MapND. Firstly, MapND strategy divided data sets referenced in the data mining into data blocks; Then, the strategy respectively estimated the sum of transmission time and data mining time between each mining node and each data block, and mapped each data block to corresponding mining node according to the minimum time sum. Experimental results show that the proposed MapND strategy can solve the problem of data load balancing for data mining nodes, and improve the performance of parallel data mining in grid.
Keywords :
data mining; grid computing; parallel processing; resource allocation; MapND; data load balancing; data mapping strategy; grid computing; mining node; parallel data mining; Data communication; Data mining; Data models; Data warehouses; Distributed databases; Real time systems; Throughput; Data Mapping; Grid; MapND Strategy; Parallel Data Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
E-Business and E-Government (ICEE), 2010 International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
978-0-7695-3997-3
Type :
conf
DOI :
10.1109/ICEE.2010.350
Filename :
5591795
Link To Document :
بازگشت