DocumentCode :
2196975
Title :
SQMD: Architecture for Scalable, Distributed Database System Built on Virtual Private Servers
Author :
Kim, Kangseok ; Pierce, Marlon E. ; Guha, Rajarshi
Author_Institution :
Community Grids Lab., Indiana Univ., Bloomington, IN
fYear :
2008
fDate :
7-12 Dec. 2008
Firstpage :
658
Lastpage :
665
Abstract :
Many scientific fields routinely generate huge datasets. In many cases, these datasets are not static but rapidly grow in size. Handling these types of datasets, as well as allowing sophisticated queries necessitates scalable distributed database systems, in which scientists are efficiently able to search the datasets. In this paper we present the architecture, implementation and performance analysis of a scalable, distributed database system built on software based virtualization environments. The system architecture makes use of a software partitioning of the database based on data clustering, SQMD (single query multiple database) mechanism, a Web service interface, and virtualization software technologies. The system allows uniform access to concurrently distributed databases, using the SQMD mechanism based on the publish/subscribe paradigm. We highlight the scalability of our architecture by applying it to a database of 17 million chemical structures. In addition to simple identifier based retrieval, we will present performance results for shape similarity queries, which is extremely, time intensive with traditional architectures.
Keywords :
data visualisation; distributed databases; pattern clustering; query processing; software architecture; user interfaces; virtual private networks; SQMD architecture; Web service interface; data clustering; identifier based retrieval; publish-subscribe paradigm; scalable distributed database system; single query multiple database mechanism; software partitioning; virtual private servers; virtualization software technologies; Chemical technology; Computer architecture; Database systems; Distributed databases; Performance analysis; Scalability; Service oriented architecture; Software performance; Software systems; Web services; data clustering; distributed database system; virtualization; web service;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
eScience, 2008. eScience '08. IEEE Fourth International Conference on
Conference_Location :
Indianapolis, IN
Print_ISBN :
978-1-4244-3380-3
Electronic_ISBN :
978-0-7695-3535-7
Type :
conf
DOI :
10.1109/eScience.2008.35
Filename :
4736881
Link To Document :
بازگشت