• DocumentCode
    1930557
  • Title

    A Self-Recovery Model for Distributed Applications Based on Microreboot

  • Author

    Wang, Huiqiang ; Ye, Haizhi ; Ying, Liang

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Harbin Eng. Univ., Harbin
  • fYear
    2008
  • fDate
    28-29 Jan. 2008
  • Firstpage
    521
  • Lastpage
    524
  • Abstract
    Automatic and fast recovery from failure is the important way of guaranteeing high availability for distributed application systems. On the base of microreboot techniques and autonomic computing ideas, key issues of realizing self-recovery for distributed application are analyzed in this paper, and then a novel model of self-recovery for distributed application based on microreboot is presented. The construction of the model are expatiated in detail from several perspectives, such as behavior monitoring, failure management and recovery policy, and the principles of realizing self- recovery for distributed application are explained. The established model aims to solve the problems of common failures in large distributed applications, and can recovery itself effectively without human interventions.
  • Keywords
    distributed processing; fault tolerant computing; system recovery; autonomic computing; behavior monitoring; distributed application systems; failure management; microreboot techniques; recovery policy; self recovery model; Application software; Availability; Computer crashes; Computer science; Computerized monitoring; Condition monitoring; Distributed computing; Educational institutions; Humans; Internet; Autonomic Computing; Distribute Application; Microreboot; State Isolation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Internet Computing in Science and Engineering, 2008. ICICSE '08. International Conference on
  • Conference_Location
    Harbin
  • Print_ISBN
    978-0-7695-3112-0
  • Electronic_ISBN
    978-0-7695-3112-0
  • Type

    conf

  • DOI
    10.1109/ICICSE.2008.52
  • Filename
    4548317