• DocumentCode
    2136351
  • Title

    FPG-Grow: A Graph Based Pattern Grow Algorithm for Application Level IO Pattern Mining

  • Author

    Zhang Jing-Liang ; Zhang Jun-wei ; Xu Lu

  • Author_Institution
    Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
  • fYear
    2010
  • fDate
    15-17 July 2010
  • Firstpage
    311
  • Lastpage
    316
  • Abstract
    The previous study of pattern discovery in storage systems focus on sequential pattern (SP) mining in lower level traces, but they don´t scale well to the application level. For patterns in application level are mostly composed of Contiguous Item Sequential Patterns (CISP) which are much simpler than SP, so it´s inefficient for the previous studies to mine CISP with clumsy SP mining algorithms. We propose a novel algorithm FPG-Grow which is more preferable for mining application level IO patterns. The FPG-Grow only scan the origin sequence in one-pass to construct a Frequent Pattern Graph (FPG), from which we can easily extract the CISPs by fetching the frequent sub-graphs with linear cost. Also we can do the verification efficiently by avoiding the origin sequence scan. Furthermore, the grow method will eliminate the information loss introduced by sequence cutting as C-Miner does. The experiment result shows that the FPG-Grow outperforms C-Miner prominently in mining with real application IO traces and the simulation result also proves the effectiveness of CISP in application IO optimizations.
  • Keywords
    data mining; graph theory; optimisation; C-Miner; FPG-Grow; IO optimizations; contiguous item sequential patterns; frequent pattern graph; graph based pattern grow algorithm; level IO pattern mining; pattern discovery; storage systems; Algorithm design and analysis; Computers; Correlation; Data mining; Optimization; Search problems; Testing; C-Miner; CISP; Clospan; FPG; KCL(Kirchhoff´s Current Law); SP;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networking, Architecture and Storage (NAS), 2010 IEEE Fifth International Conference on
  • Conference_Location
    Macau
  • Print_ISBN
    978-1-4244-8133-0
  • Type

    conf

  • DOI
    10.1109/NAS.2010.23
  • Filename
    5575676