Title :
Biological Data Warehousing System for Identifying Transcriptional Regulatory Sites From Gene Expressions of Microarray Data
Author :
Tsou, Ann-Ping ; Sun, Yi-Ming ; Liu, Chia-Lin ; Huang, Hsien-Da ; Horng, Jorng-Tzong ; Tsai, Meng-Feng ; Liu, Baw-Juine
Author_Institution :
Inst. of Biotechnol. in Med., Nat. Yang-Ming Univ., Taipei
fDate :
7/1/2006 12:00:00 AM
Abstract :
Identification of transcriptional regulatory sites plays an important role in the investigation of gene regulation. For this propose, we designed and implemented a data warehouse to integrate multiple heterogeneous biological data sources with data types such as text-file, XML, image, MySQL database model, and Oracle database model. The utility of the biological data warehouse in predicting transcriptional regulatory sites of coregulated genes was explored using a synexpression group derived from a microarray study. Both of the binding sites of known transcription factors and predicted over-represented (OR) oligonucleotides were demonstrated for the gene group. The potential biological roles of both known nucleotides and one OR nucleotide were demonstrated using bioassays. Therefore, the results from the wet-lab experiments reinforce the power and utility of the data warehouse as an approach to the genome-wide search for important transcription regulatory elements that are the key to many complex biological systems
Keywords :
SQL; XML; biochemistry; biology computing; database management systems; genetics; identification; molecular biophysics; MySQL database model; Oracle database model; XML; binding sites; bioassays; biological data warehousing system; complex biological system; gene expressions; gene regulation; genome-wide search; image; microarray data; multiple heterogeneous biological data source; predicted over-represented oligonucleotides; synexpression group; text-file; transcription factors; transcriptional regulatory site identification; wet-lab experiment; Bioinformatics; Biological system modeling; Data warehouses; Educational institutions; Gene expression; Genomics; Image databases; Sequences; Sun; Warehousing; Databases; data warehouse; gene expression; gene regulation; microarray; regulatory sites; synexpression group; transcription factor;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2005.862466