Title :
HDW: A High Performance Large Scale Data Warehouse
Author :
You, Jinguo ; Xi, Jianqing ; Zhang, Chuan ; Guo, Gengqi
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou
Abstract :
As data warehouses grow in size, ensuring adequate database performance will be a big challenge. This paper presents a solution, called HDW, based on Google infrastructure such as GFS, Bigtable, MapReduce to build and manage a large scale distributed data warehouse for high performance OLAP analysis. In addition, HDW provides XMLA standard interface for front end applications. The results show that HDW achieves pretty good performance and high scalability, which has been demonstrated on at least 18 nodes with 36 cores.
Keywords :
XML; data mining; data warehouses; distributed databases; Google infrastructure; OLAP analysis; XMLA standard interface; high performance large scale distributed data warehouse; Clustering algorithms; Concurrent computing; Costs; Data visualization; Data warehouses; High performance computing; Large-scale systems; Partitioning algorithms; Performance analysis; Search engines; Bigtable; GFS; MapReduce; data warehouse; high performance;
Conference_Titel :
Computer and Computational Sciences, 2008. IMSCCS '08. International Multisymposiums on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3430-5
DOI :
10.1109/IMSCCS.2008.16