DocumentCode :
170545
Title :
When queueing meets coding: Optimal-latency data retrieving scheme in storage clouds
Author :
Shengbo Chen ; Yin Sun ; Kozat, Ulas C. ; Longbo Huang ; Sinha, Pradeep ; Guanfeng Liang ; Xin Liu ; Shroff, Ness B.
Author_Institution :
Dept. of ECE, Ohio State Univ., Columbus, OH, USA
fYear :
2014
fDate :
April 27 2014-May 2 2014
Firstpage :
1042
Lastpage :
1050
Abstract :
Storage clouds, such as Amazon S3, are being widely used for web services and Internet applications. It has been observed that the delay for retrieving data from and placing data into the clouds is quite random, and exhibits weak correlations between different read/write requests. This inspires us to investigate a key problem: can we reduce the delay by transmitting data replications in parallel or using powerful erasure codes? In this paper, we study the problem of reducing the delay of downloading data from cloud storage systems by leveraging multiple parallel threads, assuming that the data has been encoded and stored in the clouds using fixed rate forward error correction (FEC) codes with parameters (n, k). That is., each file is divided into k equal-sized chunks, which are then expanded into n chunks such that any k chunks out of the n are sufficient to successfully restore the original file. The model can be depicted as a multiple-server queue with arrivals of data retrieving requests and a server corresponding to a thread. However, this is not a typical queueing model because a server can terminate its operation, depending on when other servers complete their service (due to the redundancy that is spread across the threads). Hence, to the best of our knowledge, the analysis of this queueing model remains quite uncharted. Real traces from Amazon S3 show that the time to retrieve a fixed size chunk is random and can be accurately approximated as an i.i.d. exponentially distributed random variable. We show that any work-conserving scheme is delay-optimal when k = 1. When k > 1, we find that a simple greedy scheme, which allocates all available threads to the head of line request, is delay optimal, which appears surprising.
Keywords :
Web services; cloud computing; forward error correction; storage management; Amazon S3; FEC codes; Internet applications; Web services; cloud storage systems; data retrieving requests; downloading data; exponentially distributed random variable; forward error correction codes; greedy scheme; multiple parallel threads; multiple-server queue; optimal-latency data retrieving scheme; parallel data replication transmission; powerful erasure codes; queueing model; read/write requests; work-conserving scheme; Cloud computing; Delays; Encoding; Instruction sets; Redundancy; Resource management; Servers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
INFOCOM, 2014 Proceedings IEEE
Conference_Location :
Toronto, ON
Type :
conf
DOI :
10.1109/INFOCOM.2014.6848034
Filename :
6848034
Link To Document :
بازگشت