Title :
Dynamic distributed dimensional data model (D4M) database and computation system
Author :
Kepner, Jeremy ; Arcand, William ; Bergeron, William ; Bliss, Nadya ; Bond, Robert ; Byun, Chansup ; Condon, Gary ; Gregson, Kenneth ; Hubbell, Matthew ; Kurz, Jonathan ; McCabe, Andrew ; Michaleas, Peter ; Prout, Andrew ; Reuther, Albert ; Rosa, Antonio
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
Abstract :
A crucial element of large web companies is their ability to collect and analyze massive amounts of data. Tuple store databases are a key enabling technology employed by many of these companies (e.g., Google Big Table and Amazon Dynamo). Tuple stores are highly scalable and run on commodity clusters, but lack interfaces to support efficient development of mathematically based analytics. D4M (Dynamic Distributed Dimensional Data Model) has been developed to provide a mathematically rich interface to tuple stores (and structured query language “SQL” databases). D4M allows linear algebra to be readily applied to databases. Using D4M, it is possible to create composable analytics with significantly less effort than using traditional approaches. This work describes the D4M technology and its application and performance.
Keywords :
SQL; data handling; database management systems; distributed processing; linear algebra; Amazon Dynamo; D4M database; Google Big Table; SQL databases; Web companies; commodity clusters; computation system; dynamic distributed dimensional data model database; linear algebra; structured query language; tuple stores; Arrays; Distributed databases; Linear algebra; MATLAB; Standards; associative array; database; fuzzy algebra; linear algebra; tuple store;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289129