• DocumentCode
    3301373
  • Title

    A Request Skew Aware Heterogeneous Distributed Storage System Based on Cassandra

  • Author

    Ye, Zhen ; Li, Shanping

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
  • fYear
    2011
  • fDate
    19-21 May 2011
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    many distributed storage systems have been proposed to provide high scalability and high availability for modern web applications. However, most of those applications only aware data skew while actually request skew is also widely exist and needed to be considered as well. In this paper, we present a request skew aware heterogeneous distributed storage system based on Cassandra-a famous NoSQL database aiming to manage very large scale data without single point of failure. We improve Cassandra through two ways: 1) minimize forward request load by shifting the node where the client application connect to the one which can handle maximum number of skewed request dynamically; 2) when balancing data load among all nodes within the cluster, take their storage capacity into consideration. The results of our experiment present that we can reduce about 25% forward read request and 15% forward write request through approach 1) and balance storage utilization of each node obviously after applying 2).
  • Keywords
    SQL; distributed processing; storage management; Cassandra; NoSQL database; data management; forward request load minimization; request skew aware heterogeneous distributed storage system; Availability; Clustering algorithms; Distributed databases; Generators; Scalability; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Management (CAMAN), 2011 International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-9282-4
  • Type

    conf

  • DOI
    10.1109/CAMAN.2011.5778745
  • Filename
    5778745