Title :
A Fault-Tolerant Middleware Architecture for High-Availability Storage Services
Author :
Seshadri, Sangeetha ; Liu, Ling ; Cooper, Brian F. ; Chiu, Lawrence ; Gupta, Karan ; Muench, Paul
Author_Institution :
Georgia Inst. of Technol., Atlanta
Abstract :
Today organizations and business enterprises of all sizes need to deal with unprecedented amounts of digital information, creating challenging demands for mass storage and on-demand storage services. The current trend of clustered scale-out storage systems use symmetric active replication based clustering middleware to provide continuous availability and high throughput. Such architectures provide significant gains in terms of cost, scalability and performance of mass storage and storage services. However, a fundamental limitation of such an architecture is its vulnerability to application-induced massive dependent failures of the clustering middleware. In this paper, we propose hierarchical middleware architectures that improve availability and reliability in scale-out storage systems while continuing to deliver the cost and performance advantages and a single system image (SSI). Hierarchical middleware architectures organize critical cluster management services into an overlay network that provides application fault isolation and eliminates symmetric clustering middleware as a single-point-of-failure. We present an in-depth evaluation of hierarchical middlewares based on an industry-strength storage system. Our results show that hierarchical architectures can significantly improve availability and reliability of scale-out storage clusters.
Keywords :
middleware; software fault tolerance; storage management; workstation clusters; application-induced massive dependent failures; cluster management services; clustered scale-out storage systems; clustering middleware; digital information; fault-tolerant middleware architecture; hierarchical middleware architecture; high-availability storage services; mass storage services; ondemand storage services; scale-out storage systems reliability; single system image; symmetric active replication; Availability; Computer architecture; Costs; Fault tolerance; Hardware; Image storage; Middleware; Performance gain; Scalability; Throughput;
Conference_Titel :
Services Computing, 2007. SCC 2007. IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7695-2925-9