Title :
A fast map-reduce algorithm for burst errors in big data cloud storage
Author :
Xue Qin ; Kelley, Brian ; Saedy, Mahdy
Author_Institution :
Dept. of ECE, Univ. of Texas at San Antonio, San Antonio, TX, USA
Abstract :
In distributed storage for Big Data systems, there is a need for exact repair, high bandwidth codes. The challenge for exact repair in big-data storage is to simultaneously enable both very high bandwidth repair using Map-Reduce and simple coding schemes that also combine robust maximally distance separable (MDS) exact repair. MDS repair is for the rare, but exceptional outlier error patterns requiring optimum erasure code reconstruction. We construct the optimum fast bandwidth repair for big-data sources. Our system uses Map-Reduce, exact repair reconstruction. The algorithm combines MDS with a second fast decode algorithm in a cloud environment. We illustrate cloud experiments for optimum fast bandwidth reconstruction for 1-Exabyte Big Data in the cloud and demonstrate cloud results for Poisson error rate arrival models. Unlike prior methods, we jointly solve the problem of fast bandwidth repair for burst-memory error patterns and for code rates up to - in a real time error model framework for Big Data. Furthermore, simulations indicate this method outperforms prior fast bandwidth approaches for burst errors. We also illustrate Map-Reduce algorithm optimized for fast bandwidth repair in Big Data storage in clouds.
Keywords :
Big Data; cloud computing; parallel processing; stochastic processes; storage management; 1-Exabyte Big Data; Big Data systems; Big data cloud storage; Big-Data sources; MDS repair; MapReduce algorithm; Poisson error rate arrival models; bandwidth codes; burst-memory error patterns; cloud environment; coding schemes; distributed storage; exact repair reconstruction; optimum erasure code reconstruction; optimum fast bandwidth repair; real time error model framework; robust maximally distance separable exact repair; Bandwidth; Big data; Distributed databases; Encoding; Image reconstruction; Maintenance engineering; Systems engineering and theory; Exabyte Big Data; Map-Reduce; cloud systems; erasure codes; exact repair; fast reconstruction; maximally distance separable;
Conference_Titel :
System of Systems Engineering Conference (SoSE), 2015 10th
Conference_Location :
San Antonio, TX
DOI :
10.1109/SYSOSE.2015.7151912