DocumentCode :
3781778
Title :
Optimizing Data Partition for NoSQL Cluster
Author :
Xiangdong Huang;Jianmin Wang;Yu Zhong;Philip S. Yu
Author_Institution :
Sch. of Software, Tsinghua Univ., Beijing, China
fYear :
2015
Firstpage :
962
Lastpage :
969
Abstract :
The data partition balance impacts the performance of NoSQL systems significantly. Most of the P2P NoSQL systems use consistent hashing to partition data automatically. Currently, these systems use random virtual nodes or manual configuration to divide the consistent hashing ring, which may cause load imbalance and degrade the performance. The problem is pronounced especially for heterogeneous clusters. In this paper, we focus on the partition strategy of consistent hashing ring and propose a data partition quantified criterion. When initializing a cluster, we convert the problem to an optimization problem to find the most even partitioning result. Experiments on Cassandra and Voldemort show these methods are better than current implementations. Besides, the algorithms are very efficient even for heterogeneous clusters.
Keywords :
"Partitioning algorithms","Clustering algorithms","Optimization","Servers","Scalability","Software","Manuals"
Publisher :
ieee
Conference_Titel :
Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom), 2015 IEEE 12th Intl Conf on
Type :
conf
DOI :
10.1109/UIC-ATC-ScalCom-CBDCom-IoP.2015.182
Filename :
7518361
Link To Document :
بازگشت