DocumentCode
1872423
Title
Supporting load balancing for distributed data-intensive applications
Author
Glimcher, Leonid ; Ravi, Vignesh T. ; Agrawal, Gagan
Author_Institution
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear
2009
fDate
16-19 Dec. 2009
Firstpage
235
Lastpage
244
Abstract
In data-intensive computing, an important problem that has received relatively little attention is of transparent processing of data stored in remote data repositories. Interesting load balancing considerations arise for these scenarios. Particularly, based on where data is generated and how it is shared, a dataset of interest can be divided across multiple data repositories, which may be geographically distributed and the data may be partitioned in a number of ways. This paper focuses on enabling such distributed processing of data from distributed resources. We have developed a load balancing algorithm, which minimizes the total time spent on processing the data. We consider weighted sum of two factors, a load balancing factor and a term that captures the amount of time spent by processing nodes waiting for the data. Our solutions have been implemented and evaluated in the context of FREERIDE-G (FRamework for Rapid Implementation of Datamining Engines in Grid). We have extensively evaluated our techniques using two data-intensive applications.
Keywords
data mining; grid computing; resource allocation; FREERIDE-G; distributed data-intensive applications; distributed processing; distributed resources; framework for rapid implementation of datamining engines in grid; load balancing; remote data repositories; transparent processing; Application software; Computer science; Data analysis; Data mining; Data processing; Distributed computing; Engines; Load management; Middleware; Partitioning algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing (HiPC), 2009 International Conference on
Conference_Location
Kochi
Print_ISBN
978-1-4244-4922-4
Electronic_ISBN
978-1-4244-4921-7
Type
conf
DOI
10.1109/HIPC.2009.5433204
Filename
5433204
Link To Document