DocumentCode
560216
Title
Qserv: A distributed shared-nothing database for the LSST catalog
Author
Wang, Daniel L. ; Monkewitz, Serge M. ; Lim, Kian-Tat ; Becla, Jacek
Author_Institution
SLAC Nat. Accel. Lab., Menlo Park, CA, USA
fYear
2011
fDate
12-18 Nov. 2011
Firstpage
1
Lastpage
11
Abstract
The LSST project will provide public access to a database catalog that, in its final year, is estimated to include 26 billion stars and galaxies in dozens of trillion detections in multiple petabytes. Because we are not aware of an existing open-source database implementation that has been demonstrated to efficiently satisfy astronomers´ spatial self-joining and cross-matching queries at this scale, we have implemented Qserv, a distributed shared-nothing SQL database query system. To speed development, Qserv relies on two successful open-source software packages: the MySQL RDBMS and the Xrootd distributed file system. We describe Qserv´s design, architecture, and ability to scale to LSST´s data requirements. We illustrate its potential with test results on a 150-node cluster using 55 billion rows and 30 terabytes of simulated data. These results demonstrate the soundness of Qserv´s approach and the scale it achieves on today´s hardware.
Keywords
SQL; astronomy computing; public domain software; query processing; relational databases; scientific information systems; LSST catalog; MySQL RDBMS; Qserv; Xrootd distributed file system; astronomers; database catalog; distributed shared nothing SQL database query system; galaxies; open source database implementation; public access; stars; Bandwidth; Catalogs; Distributed databases; Hardware; Indexing; Servers; MPP; database; distributed; file system; parallel; shared-nothing;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2011 International Conference for
Conference_Location
Seatle, WA
Electronic_ISBN
978-1-4503-0771-0
Type
conf
Filename
6114487
Link To Document