Title :
From sipping on a straw to drinking from a fire hose: data integration in a public genome database
Author :
Richardson, J.E. ; Kadin, J.A. ; Blake, J.A. ; Bult, C.J. ; Eppig, J.T. ; Ringwald, M.
Author_Institution :
Mouse Genome Informatics Group, The Jackson Lab., Bar Harbor, ME, USA
fDate :
30 March-2 April 2004
Abstract :
Biology is a vast domain. The Mouse Genome Informatics (MGI) system, which focuses on the biology of the laboratory mouse, covers only a small, carefully chosen slice. Nevertheless, we deal with data of immense variety, deep complexity, and exponentially growing volume. Our role as an integration nexus is to add value by combining data sets of diverse types and origins, eliminating redundancy and resolving conflicts. We briefly describe some of the issues we face and approaches we have adopted to the integration problem.
Keywords :
biology computing; genetics; information resources; very large databases; data integration; data sets; laboratory mouse; mouse genome informatics system; public genome database; Bioinformatics; Capacitive sensors; Databases; Fires; Genomics; Hoses; Humans; Laboratories; Mice; Sequences;
Conference_Titel :
Data Engineering, 2004. Proceedings. 20th International Conference on
Print_ISBN :
0-7695-2065-0
DOI :
10.1109/ICDE.2004.1320050