DocumentCode :
2371305
Title :
Regulatory element discovery using tree-structured models
Author :
Phuong, Tu Minh ; Lee, Doheon ; Lee, Kwang Hyung
Author_Institution :
Dept. of BioSystems, Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
fYear :
2003
fDate :
19-22 Nov. 2003
Firstpage :
739
Lastpage :
742
Abstract :
Computational discovery of transcriptional regulatory regions in DNA sequences provides an efficient way to broaden our understanding of how cellular processes are controlled. We formulate the regulatory element discovery problem in the regression framework with regulatory regions treated as predictor variables and gene expression levels as responses. We use regression tree models to identify structural relationships between predictors and responses. The regression tree methodology is extended to handle multiple responses from different experiments by modifying the split function. We apply this method to two data sets of the yeast Saccharomyces cerevisiae. The method successfully identifies most of regulatory motifs that are known to control gene transcription under the given experimental conditions. Our method also suggests several putative motifs that present novel regulatory motifs.
Keywords :
DNA; biology computing; data mining; genetics; regression analysis; trees (mathematics); DNA sequences; Saccharomyces cerevisiae yeast; cellular process; gene transcription; predictor variables; putative motifs; regression tree models; regulatory element discovery; structural relationship; Biological system modeling; DNA computing; Fungi; Gene expression; Multivariate regression; Predictive models; Process control; Proteins; Regression tree analysis; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
Type :
conf
DOI :
10.1109/ICDM.2003.1251021
Filename :
1251021
Link To Document :
بازگشت