• DocumentCode
    3717320
  • Title

    Account clustering in multi-tenant storage management environments

  • Author

    Gabor Madl;Ramani Routray;Yang Song;Rakesh Jain

  • Author_Institution
    Cloud Systems Analytics, IBM Research, Almaden, San Jose, CA 95120
  • fYear
    2015
  • Firstpage
    1698
  • Lastpage
    1707
  • Abstract
    Multi-tenant storage management environments typically manage multiple enterprise accounts with heterogeneous storage footprints. Identifying and grouping accounts with similar storage footprints into clusters reduces account management overhead, and provides a framework for data-driven storage recommendation services. This paper describes a method for the clustering of accounts in multi-tenant storage management environments. Storage system vendors, models, and footprints are captured as a set of properties, and a pairwise distance function is employed to grade similarity between accounts along multiple dimensions. A graph representation is created based on account similarity values. Finally, the clustering algorithm is defined as a graph algorithm with the purpose of repeatedly finding maximum cliques, and removing them from the graph. The result of the graph algorithm is a set of clusters, each grouping together accounts with very similar storage footprints. Clusters are then rated along multiple metrics, are compared to their peers along multiple performance dimensions, and receive recommendations on how to further improve storage efficiency.
  • Keywords
    "Storage management","Clustering algorithms","Measurement","Optimization","Big data","Standards","Graph theory"
  • Publisher
    ieee
  • Conference_Titel
    Big Data (Big Data), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/BigData.2015.7363941
  • Filename
    7363941