Title :
GIMS-a data warehouse for storage and analysis of genome sequence and functional data
Author :
Cornell, Mike ; Paton, Norman W. ; Wu, Shengli ; Goble, Carole A. ; Miller, Crispin J. ; Kirby, Paul ; Eilbeck, Karen ; Brass, Andy ; Hayes, Andrew ; Oliver, Stephen G.
Author_Institution :
Dept. of Comput. Sci., Manchester Univ., UK
Abstract :
Effective analysis of genome sequences and associated functional data requires access to many different kinds of biological information. For example, when analysing gene expression data, it may be useful to have access to the sequences upstream of the genes, or to the cellular location of their protein products. Such information is currently stored in different formats at different sites in a way that does not readily allow integrated analyses. The Genome Information Management System (GIMS) is an object database that integrates genome sequence data with functional data on the transcriptome and on protein-protein interactions in a single data warehouse. We have used GIMS to store the Saccharomyces cerevisiae (yeast) genome and to demonstrate how the integrated storage of diverse kinds of genomic data can be beneficial for analysing data using context-rich queries and analyses. GIMS allows data to be stored in a way that reflects the underlying mechanisms in the organism, and permits complex questions to be asked of the data. This paper provides an overview of the GIMS system and describes some analyses that illustrate its use for analysing functional data sets for S. cerevisiae
Keywords :
biology computing; cellular biophysics; data analysis; data warehouses; genetics; molecular biophysics; object-oriented databases; proteins; scientific information systems; sequences; GIMS; Genome Information Management System; Saccharomyces cerevisiae genome; biological information; cellular location; context-rich analyses; context-rich queries; data warehouse; functional data analysis; functional data storage; gene expression data; genome sequence analysis; genome sequence storage; object database; organism; protein products; protein-protein interactions; transcriptome; yeast; Bioinformatics; Data analysis; Data warehouses; Databases; Fungi; Gene expression; Genomics; Information analysis; Information management; Proteins;
Conference_Titel :
Bioinformatics and Bioengineering Conference, 2001. Proceedings of the IEEE 2nd International Symposium on
Conference_Location :
Bethesda, MD
Print_ISBN :
0-7695-1423-5
DOI :
10.1109/BIBE.2001.974407