• DocumentCode
    1918370
  • Title

    Abstract: Using Provenance to Visualize Data from Large-Scale Experiments

  • Author

    Horta, Felipe ; Dias, Joana ; Ocana, Kary A. C. S. ; de Oliveira, Daniel ; Ogasawara, Eduardo ; Mattoso, Marta

  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    1418
  • Lastpage
    1419
  • Abstract
    Large-scale scientific computations are often organized as a composition of many computational tasks linked through data flow. The data that flows along this manytask computing often moves from a desktop to a high-performance environment and to a visualization environment. Keeping track of this data flow is a challenge to provenance support in high-performance Scientific Workflow Management Systems. After the completion of a computational scientific experiment, a scientist has to manually select and analyze its staged-out data, for instance, by checking inputs and outputs along computational tasks that were part of the experiment. In this paper, we present a provenance management system that describes the production and consumption relationships between data artifacts, such as files, and the computational tasks that compose the experiment. We propose a query interface that allows for scientists to browse provenance data and select the output they want to visualize using browsers or a high-resolution tiled display.
  • Keywords
    data flow analysis; data visualisation; multiprogramming; query processing; scientific information systems; user interfaces; workflow management software; computational scientific experiment; computational tasks; data artifacts; data flow; data visualization environment; high-performance environment; high-performance scientific workflow management systems; high-resolution tiled display; large-scale experiments; large-scale scientific computations; many-task computing; production-consumption relationships; provenance management system; query interface; staged-out data analysis; cluster; hpc; large-scale; provenance; scientific workflow; visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
  • Conference_Location
    Salt Lake City, UT
  • Print_ISBN
    978-1-4673-6218-4
  • Type

    conf

  • DOI
    10.1109/SC.Companion.2012.228
  • Filename
    6496011