• DocumentCode
    451122
  • Title

    Querying Very Large Multi-dimensional Datasets in ADR

  • Author

    Kurc, T. ; Chialin Chang ; Ferreira, R. ; Sussman, A. ; Saltz, J.

  • Author_Institution
    University of Maryland
  • fYear
    1999
  • fDate
    13-18 Nov. 1999
  • Firstpage
    12
  • Lastpage
    12
  • Abstract
    Applications that make use of very large scientific datasets have become an increasingly important subset of scientific applications. In these applications, datasets are often multi-dimensional, i.e., data items are associated with points in a multi-dimensional attribute space, and access to data items is described by range queries. The basic processing involves mapping input data items to output data items, and some form of aggregation of all the input data items that project to the each output data item. We have developed an infrastructure, called the Active Data Repository (ADR), that integrates storage, retrieval and processing of multi-dimensional datasets on distributed-memory parallel architectures with multiple disks attached to each node. In this paper we address efficient execution of range queries on distributed memory parallel machines within ADR framework. We present three potential strategies, and evaluate them under different application scenarios and machine configurations. We present experimental results on the scalability and performance of the strategies on a 128-node IBM SP.
  • Keywords
    Application software; Biomedical imaging; Chemical sensors; Computer science; Educational institutions; Information retrieval; Parallel architectures; Pathology; Satellites; Sensor phenomena and characterization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, ACM/IEEE 1999 Conference
  • Print_ISBN
    1-58113-091-0
  • Type

    conf

  • DOI
    10.1109/SC.1999.10046
  • Filename
    1592655