• DocumentCode
    1791836
  • Title

    A layer based architecture for provenance in big data

  • Author

    Imran, Ashiq ; Agrawal, Rajeev ; Walker, Jessie ; Gomes, Anthony

  • Author_Institution
    Department of Computer Science, North Carolina A&T, State University, Greensboro, USA
  • fYear
    2014
  • fDate
    27-30 Oct. 2014
  • Firstpage
    29
  • Lastpage
    31
  • Abstract
    Big data is a new technology wave that makes the world awash in data. Various organizations accumulate data that are difficult to exploit. Government databases, social media, healthcare databases etc. are the examples of that big data. Big data covers absorbing and analyzing huge amount of data that may have originated or processed outside of the organization. Data provenance can be defined as origin and process of data. It carries significant information of a system. It can be useful for debugging, auditing, measuring performance and trust in data. Data provenance in big data is relatively unexplored topic. It is necessary to appropriately track the creation and collection process of the data to provide context and reproducibility. This poster tries to address the challenges of capturing provenance data. Additionally, we propose an intuitive layer based architecture of provenance in big data that can handle the challenges.
  • Keywords
    Access control; Big data; Computer architecture; Data visualization; Databases; Educational institutions; Big data; Provenance; Query; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (Big Data), 2014 IEEE International Conference on
  • Conference_Location
    Washington, DC, USA
  • Type

    conf

  • DOI
    10.1109/BigData.2014.7004483
  • Filename
    7004483