• DocumentCode
    2891556
  • Title

    Automated Storage Tiering Using Markov Chain Correlation Based Clustering

  • Author

    Alshawabkeh, M. ; Riska, Alma ; Sahin, Alphan ; Awwad, M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA, USA
  • Volume
    1
  • fYear
    2012
  • fDate
    12-15 Dec. 2012
  • Firstpage
    392
  • Lastpage
    397
  • Abstract
    In this paper, we develop an automated and adaptive framework that aims to move active data to high performance storage tiers and inactive data to low cost/high capacity storage tiers by learning patterns of the storage workloads. The framework proposed is designed using efficient Markov chain correlation based clustering method (MCC), which can quickly predict or detect any changes in the current workload based on what the system has experienced before. The workload data is first normalized and Markov chains are constructed from the dynamics of the IO loads of the data storage units. Based on the correlation of one-step Markov chain transition probabilities k-means method is employed to group the storage units that have similar behavior at each point. Such framework can then easily be incorporated in various resource management policies that aim at enhancing performance, reliability, availability. The predictive nature of the model, particularly makes a storage system both faster and lower-cost at the same time, because it only uses high performance tiers when needed, and uses low cost/high capacity tiers when possible.
  • Keywords
    Markov processes; correlation theory; pattern clustering; probability; resource allocation; storage management; IO loads; Markov chain correlation based clustering method; automated storage tiering; data storage units; high performance storage tiers; k-means method; low cost-high capacity storage tiers; one-step Markov chain transition probabilities; pattern learning; resource management policies; storage workloads; Clustering methods; Correlation; Markov processes; Monitoring; Performance evaluation; Predictive models; IO workloads; clustering; markov chain; storage tiering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2012 11th International Conference on
  • Conference_Location
    Boca Raton, FL
  • Print_ISBN
    978-1-4673-4651-1
  • Type

    conf

  • DOI
    10.1109/ICMLA.2012.71
  • Filename
    6406694