DocumentCode
251237
Title
Application of association rule mining for replication in scientific data grid
Author
Zulkar Nine, Md S. Q. ; Azad, Md Abul Kalam ; Abdullah, Saad ; Monil, Mohammad Alaul Haque ; Zahan, Ibna ; Bin Kader, Abdulla ; Rahman, Rashedur M.
Author_Institution
Dept. of Electr. & Comput. Eng., North South Univ., Dhaka, Bangladesh
fYear
2014
fDate
20-22 Dec. 2014
Firstpage
345
Lastpage
348
Abstract
Grid computing is the most popular infrastructure in many emerging field of science and engineering where extensive data driven experiments are conducted by thousands of scientists all over the world. Efficient transfer and replication of these peta-byte scale data sets are the fundamental challenges in Scientific Grid. Data grid technology is developed to permit data sharing across many organizations in geographically disperse locations. Replication of data helps thousands of researchers all over the world to access those data sets more efficiently. Data replication is essential to ensure data reliability and availability across the grid. Replication ensures above mentioned criteria by creating more copies of same data sets across the grid. In this paper, we proposed a data mining based replication to accelerate the data access time. Our proposed algorithm mines the hidden rules of association among different files for replica optimization which proves highly efficient for different access patterns. The algorithm is simulated using data grid simulator, OptorSim, developed by European Data Grid project. Then our algorithm is compared with the existing approaches where it outperforms others.
Keywords
data mining; grid computing; replicated databases; scientific information systems; European data grid project; OptorSim; access pattern; association rule mining; data access time; data availability; data grid simulator; data grid technology; data reliability; data replication; data sharing; geographically disperse location; grid computing; peta-byte scale data set; replica optimization; scientific data grid; Algorithm design and analysis; Availability; Bandwidth; Computational modeling; Data mining; Data models; Optimization; Association rule mining; Dynamic Replication; Replica Optimization; Scientific Data Grid;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering (ICECE), 2014 International Conference on
Conference_Location
Dhaka
Print_ISBN
978-1-4799-4167-4
Type
conf
DOI
10.1109/ICECE.2014.7026895
Filename
7026895
Link To Document