Title :
Elastic data partitioning for cloud-based SQL processing systems
Author_Institution :
Univ. of Hawai´i at Manoa, Honolulu, HI, USA
Abstract :
One of the key advantages of cloud computing is the elasticity in which computing resources such as virtual machines can be increased or decreased. Current state-of-the-art shared-nothing parallel SQL processing systems, on the other hand, are often designed and optimized for a fixed number of database nodes. To take advantage of the elasticity afforded by cloud computing, cloud-based SQL processing systems need the ability to repartition the data easily when the number of database nodes is scaled up or down. In this paper, we investigate the problem of supporting elastic partitioning of data in cloud-based parallel SQL processing systems. We propose several algorithms and associated data organization techniques that minimizes the re-partitioning of tuples and the movement of data between nodes. Our experimental evaluation demonstrates the effectiveness of the proposed methods.
Keywords :
SQL; cloud computing; parallel processing; associated data organization techniques; cloud computing; cloud-based SQL processing systems; database nodes; elastic data partitioning; shared-nothing parallel SQL processing systems; virtual machines; Catalogs; Cloud computing; Database systems; Elasticity; Partitioning algorithms; Virtual machining;
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
DOI :
10.1109/BigData.2013.6691766