Title :
A Data Mapping Strategy of Parallel Data Mining in Grid
Author :
Yonghong, Luo ; Tefang, Chen
Author_Institution :
Sch. of Traffic & Transp. Eng., Central South Univ., Changsha, China
Abstract :
In order to achieve data load balancing of mining nodes in parallel data mining based on grid, this paper proposes a data mapping strategy based on minimum time cost: MapND. Firstly, MapND strategy divided data sets referenced in the data mining into data blocks; Then, the strategy respectively estimated the sum of transmission time and data mining time between each mining node and each data block, and mapped each data block to corresponding mining node according to the minimum time sum. Experimental results show that the proposed MapND strategy can solve the problem of data load balancing for data mining nodes, and improve the performance of parallel data mining in grid.
Keywords :
data mining; grid computing; parallel processing; resource allocation; MapND; data load balancing; data mapping strategy; grid computing; mining node; parallel data mining; Data communication; Data mining; Data models; Data warehouses; Distributed databases; Real time systems; Throughput; Data Mapping; Grid; MapND Strategy; Parallel Data Mining;
Conference_Titel :
E-Business and E-Government (ICEE), 2010 International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
978-0-7695-3997-3
DOI :
10.1109/ICEE.2010.350