DocumentCode
3301373
Title
A Request Skew Aware Heterogeneous Distributed Storage System Based on Cassandra
Author
Ye, Zhen ; Li, Shanping
Author_Institution
Dept. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
fYear
2011
fDate
19-21 May 2011
Firstpage
1
Lastpage
5
Abstract
many distributed storage systems have been proposed to provide high scalability and high availability for modern web applications. However, most of those applications only aware data skew while actually request skew is also widely exist and needed to be considered as well. In this paper, we present a request skew aware heterogeneous distributed storage system based on Cassandra-a famous NoSQL database aiming to manage very large scale data without single point of failure. We improve Cassandra through two ways: 1) minimize forward request load by shifting the node where the client application connect to the one which can handle maximum number of skewed request dynamically; 2) when balancing data load among all nodes within the cluster, take their storage capacity into consideration. The results of our experiment present that we can reduce about 25% forward read request and 15% forward write request through approach 1) and balance storage utilization of each node obviously after applying 2).
Keywords
SQL; distributed processing; storage management; Cassandra; NoSQL database; data management; forward request load minimization; request skew aware heterogeneous distributed storage system; Availability; Clustering algorithms; Distributed databases; Generators; Scalability; Synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Management (CAMAN), 2011 International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-9282-4
Type
conf
DOI
10.1109/CAMAN.2011.5778745
Filename
5778745
Link To Document