DocumentCode :
3052745
Title :
Enabling Active Data Archival over Cloud
Author :
Gupta, Rajeev ; Gupta, Himanshu ; Nambiar, Ullas ; Mohania, Mukesh
Author_Institution :
IBM Res., New Delhi, India
fYear :
2012
fDate :
24-29 June 2012
Firstpage :
98
Lastpage :
105
Abstract :
The need to analyze huge amount of data for various business intelligence applications is well known. However, the rate at which enterprise data is generated now demands periodic migration of older data from the operational data warehouse to magnetic tapes. In this paper, we propose an "Active data archival service" in which the data is seamlessly archived on the cloud while ensuring that the archived data can be queried without any perceptible change to the end-user. This takes the burden of maintaining the archive off the user and shifts it to the archival service. We discuss the architecture of the service, challenges arising therein due to the federation of data brought on by the archival and how we handle these issues. Specifically, we investigate how the relational data needs to be transformed so that storing and retrieving the data from the cloud is efficient and seamless to the end user. We present our insights through an experimental study using TPC-DS benchmark.
Keywords :
cloud computing; competitive intelligence; data warehouses; information retrieval; relational databases; storage management; TPC-DS benchmark; active data archival service; business intelligence applications; cloud computing; operational data warehouse; periodic migration; relational data; Benchmark testing; Business; Data models; Data warehouses; Indexing; Memory; Vectors; MapReduce; Massive scale data management; cloud; hadoop; querying;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Services Computing (SCC), 2012 IEEE Ninth International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4673-3049-7
Type :
conf
DOI :
10.1109/SCC.2012.20
Filename :
6274132
Link To Document :
بازگشت