DocumentCode
1791806
Title
A layer based architecture for provenance in big data
Author
Agrawal, Rajeev ; Imran, Ali ; Seay, Cameron ; Walker, Julian
Author_Institution
Dept. of Comput. Syst. Technol., North Carolina A&T State Univ., Greensboro, NC, USA
fYear
2014
fDate
27-30 Oct. 2014
Firstpage
1
Lastpage
7
Abstract
Big data is a new technology wave that makes the world awash in data. Various organizations accumulate data that are difficult to exploit. Government databases, social media, healthcare databases etc. are the examples of the big data. Big data covers absorbing and analyzing huge amount of data that may have originated or processed outside of the organization. Data provenance can be defined as origin and process of data. It carries significant information of a system. It can be useful for debugging, auditing, measuring performance and trust in data. Data provenance in big data is relatively unexplored topic. It is necessary to appropriately track the creation and collection process of the data to provide context and reproducibility. In this paper, we propose an intuitive layer based architecture of data provenance and visualization. In addition, we show a complete workflow of tracking provenance information of big data.
Keywords
Big Data; data visualisation; software architecture; Big Data; auditing; data analysis; data origin; data processing; data provenance; data trust; data visualization; debugging; government databases; healthcare databases; layer based architecture; performance measurement; social media; system information; Big data; Computer architecture; Data models; Data visualization; Databases; Educational institutions; Security; Big data; Provenance; Query; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location
Washington, DC
Type
conf
DOI
10.1109/BigData.2014.7004468
Filename
7004468
Link To Document