• DocumentCode
    1791806
  • Title

    A layer based architecture for provenance in big data

  • Author

    Agrawal, Rajeev ; Imran, Ali ; Seay, Cameron ; Walker, Julian

  • Author_Institution
    Dept. of Comput. Syst. Technol., North Carolina A&T State Univ., Greensboro, NC, USA
  • fYear
    2014
  • fDate
    27-30 Oct. 2014
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Big data is a new technology wave that makes the world awash in data. Various organizations accumulate data that are difficult to exploit. Government databases, social media, healthcare databases etc. are the examples of the big data. Big data covers absorbing and analyzing huge amount of data that may have originated or processed outside of the organization. Data provenance can be defined as origin and process of data. It carries significant information of a system. It can be useful for debugging, auditing, measuring performance and trust in data. Data provenance in big data is relatively unexplored topic. It is necessary to appropriately track the creation and collection process of the data to provide context and reproducibility. In this paper, we propose an intuitive layer based architecture of data provenance and visualization. In addition, we show a complete workflow of tracking provenance information of big data.
  • Keywords
    Big Data; data visualisation; software architecture; Big Data; auditing; data analysis; data origin; data processing; data provenance; data trust; data visualization; debugging; government databases; healthcare databases; layer based architecture; performance measurement; social media; system information; Big data; Computer architecture; Data models; Data visualization; Databases; Educational institutions; Security; Big data; Provenance; Query; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (Big Data), 2014 IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Type

    conf

  • DOI
    10.1109/BigData.2014.7004468
  • Filename
    7004468