Title :
Exploring Data Warehouse Appliances for Mesh Analysis Applications
Author :
Ulmer, Craig ; Bayer, Greg ; Choe, Yung Ryn ; Roe, Diana
Author_Institution :
Sandia Nat. Labs., Livermore, CA, USA
Abstract :
As scientific computing users migrate to petaflop platforms that promise to generate multi-terabyte datasets, there is a growing need in the community to be able to embed sophisticated data analysis algorithms in the storage systems for the computing platforms. Data Warehouse Appliances (DWAs) are an attractive option for this work, due to their ability to process massive datasets efficiently. While DWAs have been proven effective in data mining and informatics applications, there are relatively few examples of how DWAs can be integrated into the scientific computing workflow. In this paper we present our experiences in adapting two mesh analysis algorithms to function on two different DWAs: a SQL-based Netezza database appliance and a Map/Reduce-based Hadoop cluster. The main contribution of this work is insight into the differences between the two platforms´ programming environments. In addition, we present performance measurements for entry-level DWAs to help provide a first-order comparison of the hardware.
Keywords :
SQL; data mining; data warehouses; mesh generation; scientific information systems; SQL-based Netezza database appliance; data mining; data warehouse appliances; informatics applications; map/reduce-based Hadoop cluster; mesh analysis; multiterabyte datasets; petaflop platforms; scientific computing; sophisticated data analysis algorithms; storage systems; Algorithm design and analysis; Clustering algorithms; Data analysis; Data mining; Data warehouses; Databases; Embedded computing; Home appliances; Informatics; Scientific computing;
Conference_Titel :
System Sciences (HICSS), 2010 43rd Hawaii International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4244-5509-6
Electronic_ISBN :
1530-1605
DOI :
10.1109/HICSS.2010.200