• DocumentCode
    2295704
  • Title

    End-to-End Scientific Data Management Using Workflows

  • Author

    Simmhan, Yogesh

  • Author_Institution
    Microsoft Res., San Francisco, CA
  • fYear
    2008
  • fDate
    6-11 July 2008
  • Firstpage
    472
  • Lastpage
    473
  • Abstract
    Workflows have evolved as the natural tool for scientists to model their eScience experiments. With the scientific world producing data at an explosive rate, workflows have an important part to play in the end to end management of scientific data. To illustrate, workflow can help with fault tolerance and ease of administration when ingesting massive quantities of data using commodity hardware. The ability for workflows to automatically collect provenance on derived scientific data improves data discovery and publication capabilities. With better support for interoperating with data centric tools, workflows can become ubiquitous systems for scientific collaboration.
  • Keywords
    data handling; software fault tolerance; commodity hardware; end-to-end scientific data management; fault tolerance; scientific collaboration; ubiquitous systems; workflows; Collaborative tools; Collaborative work; Data analysis; Distributed databases; Explosives; Fault tolerance; Hardware; Instruments; Merging; Sensor systems; data flow; distributed data management; eScience; workflow;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Services - Part I, 2008. IEEE Congress on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    978-0-7695-3286-8
  • Type

    conf

  • DOI
    10.1109/SERVICES-1.2008.22
  • Filename
    4578365