DocumentCode :
2459749
Title :
HDW: A High Performance Large Scale Data Warehouse
Author :
You, Jinguo ; Xi, Jianqing ; Zhang, Chuan ; Guo, Gengqi
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou
fYear :
2008
fDate :
18-20 Oct. 2008
Firstpage :
200
Lastpage :
202
Abstract :
As data warehouses grow in size, ensuring adequate database performance will be a big challenge. This paper presents a solution, called HDW, based on Google infrastructure such as GFS, Bigtable, MapReduce to build and manage a large scale distributed data warehouse for high performance OLAP analysis. In addition, HDW provides XMLA standard interface for front end applications. The results show that HDW achieves pretty good performance and high scalability, which has been demonstrated on at least 18 nodes with 36 cores.
Keywords :
XML; data mining; data warehouses; distributed databases; Google infrastructure; OLAP analysis; XMLA standard interface; high performance large scale distributed data warehouse; Clustering algorithms; Concurrent computing; Costs; Data visualization; Data warehouses; High performance computing; Large-scale systems; Partitioning algorithms; Performance analysis; Search engines; Bigtable; GFS; MapReduce; data warehouse; high performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Computational Sciences, 2008. IMSCCS '08. International Multisymposiums on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3430-5
Type :
conf
DOI :
10.1109/IMSCCS.2008.16
Filename :
4760324
Link To Document :
بازگشت