Title :
GADBMS: A Framework for Scalable Array Analytics
Author :
Clemons, Tyler ; Parthasarathy, Srinivasan ; Sadayappan, P.
Author_Institution :
Dept. of CSE, Ohio State Univ., Columbus, OH, USA
Abstract :
With the help of advancing technology, the scientific community and data mining community are producing an increasing amount of complex data. This data can be stored in multidimensional arrays and has been known to scale in the petabyte range. An obvious solution is to distribute the data across many nodes and work in parallel. However, optimizing storage for space limitations and access, as well as optimizing in memory execution is not intuitive. Array Database Management Systems (ADBMS) can be used to store these large datasets. This position paper will present an ADBMS supported by the Global Arrays framework that will allow users in both the scientific and data mining communities to efficiently store, access, and operate over large datasets in an easy to use framework we call GADBMS (Global-arrays Array Database Management System).
Keywords :
data analysis; data mining; database management systems; ADBMS; GADBMS framework; array database management system; data distribution; data mining community; data storage; global arrays framework; multidimensional array; scalable array analytics; scientific community; Data Mining; Database Systems; High Performance Computing; Scientific Computing;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-1-4673-6218-4
DOI :
10.1109/SC.Companion.2012.165