• DocumentCode
    3190443
  • Title

    Toward Behavioral Modeling of a Grid System: Mining the Logging and Bookkeeping Files

  • Author

    Zhang, Xiangliang ; Sebag, Michéle ; Germain, Cécile

  • Author_Institution
    Univ. Paris-Sud, Orsay
  • fYear
    2007
  • fDate
    28-31 Oct. 2007
  • Firstpage
    581
  • Lastpage
    588
  • Abstract
    Grid systems are complex heterogeneous systems, and their modeling constitutes a highly challenging goal. This paper is interested in modeling the jobs handled by the EGEE grid, by mining the Logging and Bookkeeping files. The goal is to discover meaningful job clusters, going beyond the coarse categories of "successfully terminated jobs" and "other jobs". The presented approach is a three- step process: i) Data slicing is used to alleviate the job heterogeneity and afford discriminant learning; ii) Constructive induction proceeds by learning discriminant hypotheses from each data slice; Hi) Finally, double clustering is used on the representation built by constructive induction; the clusters are fully validated after the stability criteria proposed by Meila (2006). Lastly, the job clusters are submitted to the experts and some meaningful interpretations are found.
  • Keywords
    grid computing; recording; EGEE grid; bookkeeping; complex heterogeneous systems; data slicing; grid system; logging; Clustering algorithms; Computer science; Conferences; Data mining; Europe; Humans; Machine learning; Real time systems; Stability criteria; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
  • Conference_Location
    Omaha, NE
  • Print_ISBN
    978-0-7695-3019-2
  • Electronic_ISBN
    978-0-7695-3033-8
  • Type

    conf

  • DOI
    10.1109/ICDMW.2007.52
  • Filename
    4476726