DocumentCode :
3145711
Title :
Next generation data integration for Life Sciences
Author :
Cohen-Boulakia, Sarah ; Leser, Ulf
Author_Institution :
Univ. Paris-Sud 11, Orsay, France
fYear :
2011
fDate :
11-16 April 2011
Firstpage :
1366
Lastpage :
1369
Abstract :
Ever since the advent of high-throughput biology (e.g., the Human Genome Project), integrating the large number of diverse biological data sets has been considered as one of the most important tasks for advancement in the biological sciences. Whereas the early days of research in this area were dominated by virtual integration systems (such as multi-/federated databases), the current predominantly used architecture uses materialization. Systems are built using ad-hoc techniques and a large amount of scripting. However, recent years have seen a shift in the understanding of what a “data integration system” actually should do, revitalizing research in this direction. In this tutorial, we review the past and current state of data integration for the Life Sciences and discuss recent trends in detail, which all pose challenges for the database community.
Keywords :
biology computing; data handling; ad-hoc techniques; biological data sets; biological sciences; data integration; life sciences; virtual integration systems; Bioinformatics; Biology; Data models; Distributed databases; Ontologies; Semantics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2011 IEEE 27th International Conference on
Conference_Location :
Hannover
ISSN :
1063-6382
Print_ISBN :
978-1-4244-8959-6
Electronic_ISBN :
1063-6382
Type :
conf
DOI :
10.1109/ICDE.2011.5767957
Filename :
5767957
Link To Document :
بازگشت