• DocumentCode
    2262715
  • Title

    A Novel Scalable Architecture of Cloud Storage System for Small Files Based on P2P

  • Author

    Zhang Qi-fei ; Pan Xue-zeng ; Shen Yan ; Li Wen-juan

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
  • fYear
    2012
  • fDate
    24-28 Sept. 2012
  • Firstpage
    41
  • Lastpage
    47
  • Abstract
    Scalability and Latency are the two important performance indicators for the distributed file system, and Google and Apache have achieved a great success with GFS and HDFS when operating big files, but the latency is too long when reading and writing small-size files, because the concurrent I/O can´t work for small files, besides the master node is difficult to extend in the cloud storage system with Master/Slave structure. In this paper, we propose a distributed cloud storage system based on P2P, where a central route node is introduced to improve the resource query efficiency, so clients can find data using only one message compared with Chord´s log(N). The central routing node only stores the status and routing information of all data nodes, which are indexed by the Trie Tree structure, so query time meets the requirement of online query. The data nodes store file´s content and file´s metadata thus the system is easy to extend because the master node no longer needs to store the metadata. Clients can also cache the routing information, so the read and write time is reduced according to the Locality Principle. Experiments show that the reading and writing time is significantly reduced compared with Hadoop HDFS.
  • Keywords
    cloud computing; distributed databases; peer-to-peer computing; query processing; tree data structures; Apache; Chord log(N); GFS; Google; Hadoop HDFS; P2P; central route node; concurrent IO; distributed cloud storage system; distributed file system; locality principle; master-slave structure; metadata; performance indicators; resource query efficiency; scalable architecture; trie tree structure; Cloud computing; File systems; Routing; Servers; Vegetation; Chord Routing Algorithm; Cloud Storage System; DHT; P2P; Trie Tree; the Small File;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing Workshops (CLUSTER WORKSHOPS), 2012 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4673-2893-7
  • Type

    conf

  • DOI
    10.1109/ClusterW.2012.27
  • Filename
    6355845