DocumentCode
3463175
Title
Research on Data Interoperability Based on Clustering Analysis in Data Grid
Author
Liu, Gui ; Wei, HaiLiang ; Wang, Xin ; Peng, Wei
Author_Institution
Inst. of Inf. Eng., Inf. Eng. Univ., Zhengzhou, China
fYear
2009
fDate
21-22 April 2009
Firstpage
97
Lastpage
103
Abstract
In data grid, it is an important research filed to complete interoperability of data. In the mean time, share of data also becomes the crucial problem. Data replication, as a solved solution of data share, goes into more and more vital. A strategy called replication strategy based on clustering analysis (RSCA) is proposed, which confirms the correlation among the data files accessed according to the access history of users. And then, through clustering analysis operation obtains the correlative files sets related to the access habits of users. At the same time, it produces the data files replica on the basis of those sets, which achieves the aim of prefetching and buffering data. The experimental results show that RSCA is effective and available. Contrast to other dynamic replication strategies, it has reduced not only the average response time of client nodes, but also those of the bandwidth consumption.
Keywords
data handling; grid computing; pattern clustering; statistical analysis; clustering analysis; data grid; data interoperability; data replication strategy; Application software; Bandwidth; Data analysis; Data engineering; Delay; Grid computing; History; Information analysis; Load management; Prefetching; RSCA; correlative files sets; correlative relation;
fLanguage
English
Publisher
ieee
Conference_Titel
Interoperability for Enterprise Software and Applications China, 2009. IESA '09. International Conference on
Conference_Location
Beijing
Print_ISBN
978-0-7695-3652-1
Type
conf
DOI
10.1109/I-ESA.2009.19
Filename
5260857
Link To Document