• DocumentCode
    571613
  • Title

    A High-Performance Cluster Management System Based on Distributed Hierarchical Autonomic Management Mechanism

  • Author

    Wang, Jie ; Zeng, Yu

  • Author_Institution
    Sch. of Manage., Capital Normal Univ., Beijing, China
  • Volume
    1
  • fYear
    2012
  • fDate
    26-27 Aug. 2012
  • Firstpage
    297
  • Lastpage
    300
  • Abstract
    Large size cluster management is a complex and difficult task. In this paper, we firstly discuss distributed hierarchical autonomic management mechanisms including the framework of distributed hierarchical autonomic management system and functions of each its component. And then we design and realize a high-performance cluster management system DHAView. It has autonomic management features such as global information integration, global unified monitoring and management, alarm correlation inference base on autonomic element and local event association analysis. Now this DHAView system is successfully used to manage a real large size high performance cluster.
  • Keywords
    correlation theory; distributed processing; pattern clustering; DHAView system; autonomic management features; distributed hierarchical autonomic management system; real large size high performance cluster management system; Computational modeling; Computer architecture; Computers; Correlation; Monitoring; Reliability; Scalability; autonomic computing; distributed hierarchical autonomic management; high-performance cluster management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2012 4th International Conference on
  • Conference_Location
    Nanchang, Jiangxi
  • Print_ISBN
    978-1-4673-1902-7
  • Type

    conf

  • DOI
    10.1109/IHMSC.2012.81
  • Filename
    6305685