• DocumentCode
    623995
  • Title

    Addressing big data issues in Scientific Data Infrastructure

  • Author

    Demchenko, Y. ; Grosso, Paola ; de Laat, Cees ; Membrey, Peter

  • Author_Institution
    Syst. & Network Eng. Group, Univ. of Amsterdam, Amsterdam, Netherlands
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    48
  • Lastpage
    55
  • Abstract
    Big Data are becoming a new technology focus both in science and in industry. This paper discusses the challenges that are imposed by Big Data on the modern and future Scientific Data Infrastructure (SDI). The paper discusses a nature and definition of Big Data that include such features as Volume, Velocity, Variety, Value and Veracity. The paper refers to different scientific communities to define requirements on data management, access control and security. The paper introduces the Scientific Data Lifecycle Management (SDLM) model that includes all the major stages and reflects specifics in data management in modern e-Science. The paper proposes the SDI generic architecture model that provides a basis for building interoperable data or project centric SDI using modern technologies and best practices. The paper explains how the proposed models SDLM and SDI can be naturally implemented using modern cloud based infrastructure services provisioning model and suggests the major infrastructure components for Big Data.
  • Keywords
    Web services; authorisation; cloud computing; open systems; scientific information systems; SDI; SDI generic architecture model; SDLM model; access control; big data issues; cloud based infrastructure service provisioning model; data security; e-Science; interoperable data; project centric SDI; scientific communities; scientific data infrastructure; scientific data lifecycle management model; Communities; Data handling; Data models; Data storage systems; Distributed databases; Industries; Information management; Big Data Infrastructure; Big Data Science; Cloud Infrastructure Service; Scientific Data Infrastructure (SDI); Scientific Data Lifecycle Management (SDLM);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Collaboration Technologies and Systems (CTS), 2013 International Conference on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-1-4673-6403-4
  • Type

    conf

  • DOI
    10.1109/CTS.2013.6567203
  • Filename
    6567203