• DocumentCode
    3396247
  • Title

    Effective management of hierarchical storage using two levels of data clustering

  • Author

    Orlandic, Ratko

  • Author_Institution
    Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL, USA
  • fYear
    2003
  • fDate
    7-10 April 2003
  • Firstpage
    270
  • Lastpage
    279
  • Abstract
    When data resides on tertiary storage, clustering is the key to achieving high retrieval performance. However, a straightforward approach to clustering massive amounts of data on this storage requires considerable computational and storage resources that usually exceed the capabilities of even the richest super-computing centers. This paper develops a new approach to hierarchical storage management in data grid environments, which calls for two levels of clustering data on tertiary storage. Applying a mix of static and dynamic decisions, this approach achieves the benefits of clustering at reasonable costs. However, an effective realization of the approach in generic data grid environments requires advances in the areas of indexing and clustering large scientific data collections on tertiary storage. The paper describes some novel indexing and clustering techniques that can cope well not only with extremely large volumes but also with very high dimensionalities of scientific data. The basic principles of a new clustering technique for large volumes of multi-dimensional data are introduced in the paper for the first time.
  • Keywords
    database indexing; scientific information systems; storage management; very large databases; data clustering; data grid environments; dynamic decisions; hierarchical storage management; high retrieval performance; indexing; large scientific data collections; static decisions; tertiary storage; Computer science; Costs; Databases; Engines; Environmental management; Indexing; Information retrieval; Linear particle accelerator; Physics; US Department of Energy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mass Storage Systems and Technologies, 2003. (MSST 2003). Proceedings. 20th IEEE/11th NASA Goddard Conference on
  • Print_ISBN
    0-7695-1914-8
  • Type

    conf

  • DOI
    10.1109/MASS.2003.1194863
  • Filename
    1194863