Title :
Handling catastrophic failures in scalable Internet applications
Author :
Haungs, Michael ; Pandey, Raju ; Barr, Earl
Author_Institution :
Dept. of Comput. Sci., California Polytech., San Luis Obispo, CA, USA
Abstract :
User perceived quality is the most important aspect of Internet applications. After a single negative experience, users tend to switch to one of the other myriad of alternatives available to them on the Internet. Two key components of Internet application quality are scalability and reliability. In this paper, we present the first general-purpose mechanism capable of maintaining reliability in the face of process, machine, and catastrophic failures. We define catastrophic failures as events that cause entire clusters of servers to become unavailable such as network partitioning, router failures, natural disasters, or even terrorist attacks. Our mechanism utilizes client-side tunneling, client-side redirection, and implicit redirection triggers to deliver reliable communication channels. We capitalize on previous work, redirectable sockets (RedSocks), that focuses on Internet application scalability. RedSocks are communication channels enhanced with a novel session layer aimed at modernizing network communication. We modify Red-Socks to create the first fault tolerant socket solution that can handle all server-side failures. Our mechanism is compatible with NATs and firewalls, scalable, application independent, and backwards compatible.
Keywords :
Internet; computer network management; computer network reliability; fault tolerant computing; system recovery; telecommunication channels; Internet application quality; Internet application reliability; Internet application scalability; RedSocks; catastrophic failure; client-side redirection; client-side tunneling; communication channels; fault tolerance; machine failure; network communication; network partitioning; process failure; redirectable sockets; router failures; server clusters; server-side failures; Communication channels; Internet; Maintenance; Network servers; Scalability; Sockets; Switches; Terrorism; Tunneling; Web server;
Conference_Titel :
Applications and the Internet, 2004. Proceedings. 2004 International Symposium on
Print_ISBN :
0-7695-2068-5
DOI :
10.1109/SAINT.2004.1266115