• DocumentCode
    125577
  • Title

    Device-Driven Metadata Management Solutions for Scientific Big Data Use Cases

  • Author

    Grunzke, Richard ; Muller-Pfefferkorn, Ralph ; Jakel, R. ; Hesser, Jurgen ; Kepper, Nick ; Hausmann, M. ; Starek, Jurgen ; Gesing, Sandra ; Hardt, Marcus ; Hartmann, Volker ; Potthoff, Jan ; Kindermann, Stephan

  • Author_Institution
    Tech. Univ. Dresden, Dresden, Germany
  • fYear
    2014
  • fDate
    12-14 Feb. 2014
  • Firstpage
    317
  • Lastpage
    321
  • Abstract
    Big Data applications in science are producing huge amounts of data, which require advanced processing, handling, and analysis capabilities. For the organization of large scale data sets it is essential to annotate these with metadata, index them, and make them easily findable. In this paper we investigate two scientific use cases from biology and photon science, which entail complex situations in regard to data volume, data rates and analysis requirements. The LSDMA project provides an ideal context for this research, combining both innovative R&D on the processing, handling, and analysis level and a wide range of research communities in need of scalable solutions. To facilitate the advancement of data life cycles we present preferred metadata management strategies. In biology the Open Microscopy Environment (OME) and in photon science NeXus/ICAT are presented. We show that these are well suited for the respective data life cycles. To facilitate searching across communities we discuss solutions involving the Open Archive Initiative - Protocol for Metadata Harvesting (OAI-PMH) and Apache Lucene/Solr.
  • Keywords
    biology computing; data analysis; meta data; optical microscopy; research and development; Apache Lucene; Apache Solr; LSDMA project; NeXus-ICAT; OAI-PMH; OME; advanced processing capability; analysis capability; biology; data analysis requirements; data life cycles; data rates; data set organization; data volume; device-driven metadata management solutions; handling capability; innovative R&D; light microscopy; open archive initiative-protocol-for-metadata harvesting; open microscopy environment; photon science; scientific big data use cases; Communities; DICOM; Detectors; Microscopy; Photonics; Standards; XML; Light Microscopy; Metadata Management; Photon Science; Scientific Big Data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2014 22nd Euromicro International Conference on
  • Conference_Location
    Torino
  • ISSN
    1066-6192
  • Type

    conf

  • DOI
    10.1109/PDP.2014.119
  • Filename
    6787293