DocumentCode
2262715
Title
A Novel Scalable Architecture of Cloud Storage System for Small Files Based on P2P
Author
Zhang Qi-fei ; Pan Xue-zeng ; Shen Yan ; Li Wen-juan
Author_Institution
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
fYear
2012
fDate
24-28 Sept. 2012
Firstpage
41
Lastpage
47
Abstract
Scalability and Latency are the two important performance indicators for the distributed file system, and Google and Apache have achieved a great success with GFS and HDFS when operating big files, but the latency is too long when reading and writing small-size files, because the concurrent I/O can´t work for small files, besides the master node is difficult to extend in the cloud storage system with Master/Slave structure. In this paper, we propose a distributed cloud storage system based on P2P, where a central route node is introduced to improve the resource query efficiency, so clients can find data using only one message compared with Chord´s log(N). The central routing node only stores the status and routing information of all data nodes, which are indexed by the Trie Tree structure, so query time meets the requirement of online query. The data nodes store file´s content and file´s metadata thus the system is easy to extend because the master node no longer needs to store the metadata. Clients can also cache the routing information, so the read and write time is reduced according to the Locality Principle. Experiments show that the reading and writing time is significantly reduced compared with Hadoop HDFS.
Keywords
cloud computing; distributed databases; peer-to-peer computing; query processing; tree data structures; Apache; Chord log(N); GFS; Google; Hadoop HDFS; P2P; central route node; concurrent IO; distributed cloud storage system; distributed file system; locality principle; master-slave structure; metadata; performance indicators; resource query efficiency; scalable architecture; trie tree structure; Cloud computing; File systems; Routing; Servers; Vegetation; Chord Routing Algorithm; Cloud Storage System; DHT; P2P; Trie Tree; the Small File;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing Workshops (CLUSTER WORKSHOPS), 2012 IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4673-2893-7
Type
conf
DOI
10.1109/ClusterW.2012.27
Filename
6355845
Link To Document