Title :
A Novel Scalable Architecture of Cloud Storage System for Small Files Based on P2P
Author :
Zhang Qi-fei ; Pan Xue-zeng ; Shen Yan ; Li Wen-juan
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
Abstract :
Scalability and Latency are the two important performance indicators for the distributed file system, and Google and Apache have achieved a great success with GFS and HDFS when operating big files, but the latency is too long when reading and writing small-size files, because the concurrent I/O can´t work for small files, besides the master node is difficult to extend in the cloud storage system with Master/Slave structure. In this paper, we propose a distributed cloud storage system based on P2P, where a central route node is introduced to improve the resource query efficiency, so clients can find data using only one message compared with Chord´s log(N). The central routing node only stores the status and routing information of all data nodes, which are indexed by the Trie Tree structure, so query time meets the requirement of online query. The data nodes store file´s content and file´s metadata thus the system is easy to extend because the master node no longer needs to store the metadata. Clients can also cache the routing information, so the read and write time is reduced according to the Locality Principle. Experiments show that the reading and writing time is significantly reduced compared with Hadoop HDFS.
Keywords :
cloud computing; distributed databases; peer-to-peer computing; query processing; tree data structures; Apache; Chord log(N); GFS; Google; Hadoop HDFS; P2P; central route node; concurrent IO; distributed cloud storage system; distributed file system; locality principle; master-slave structure; metadata; performance indicators; resource query efficiency; scalable architecture; trie tree structure; Cloud computing; File systems; Routing; Servers; Vegetation; Chord Routing Algorithm; Cloud Storage System; DHT; P2P; Trie Tree; the Small File;
Conference_Titel :
Cluster Computing Workshops (CLUSTER WORKSHOPS), 2012 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2893-7
DOI :
10.1109/ClusterW.2012.27