Title :
Cross Engine Database Joining
Author :
Leonard, Wesley ; Albee, Paul
Author_Institution :
Coll. of Health Professions, Central Michigan Univ., Mount Pleasant, MI, USA
Abstract :
A standards-based, open-source middleware system was designed and implemented to facilitate the analysis of large and disparate datasets. This system makes it possible to access several different types of database servers simultaneously, browse remote data, combine datasets, and join tables from remote databases independent of vendor. The system uses an algorithm known as Dynamic Merge Cache to handle data caching, query generation, transformations, and joining with minimal operational interference to source databases. The system is able to combine any subset of configured databases and convert the information into XML. The resulting XML is made available to analysis tools through a web service. After the system connects to a remote database, a metadata catalog is created from the source database. The user is able to configure which tables and fields to export from the remote dataset. The user is also able to filter, transform, and combine data. The system was tested with a large fish contaminant database and a second database populated with simulated scientific data.
Keywords :
Databases; Engines; Filters; Heuristic algorithms; Interference; Middleware; Open source software; System testing; Web services; XML; Database connectivity; Distributed databases; Middleware/business logic;
Conference_Titel :
Software Engineering Research, Management and Applications (SERA), 2010 Eighth ACIS International Conference on
Conference_Location :
Montreal, QC, Canada
Print_ISBN :
978-0-7695-4075-7
Electronic_ISBN :
978-1-4244-7337-3
DOI :
10.1109/SERA.2010.13