Title :
Predicting sugar regulation in Arabidopsis thaliana using kernel learning methods
Author :
Saadi, Kamel ; Lee, Kee-Khoon ; Cawley, Gavin C. ; Bevan, Michael W.
Author_Institution :
Sch. of Comput. Sci., East Anglia Univ., Norwich, UK
fDate :
31 July-4 Aug. 2005
Abstract :
The ability to predict the transcriptional regulation of genes, based on the composition of the upstream promoter region, would be a useful step in deciphering gene regulatory networks in eukaryotic organisms. In this paper we perform optimally regularised kernel Fisher discriminant (ORKFD) analysis of the upstream promoter sequences of genes to predict whether they are up- or down-regulated in response to glucose in the model plant Arahuiopsis thaliana. Three feature selection strategies are investigated, namely use of known promoter motifs drawn from the PLACE database, explicit enumeration of all possible k-mers and the use of the mismatch kernels (which effectively permits the construction of a linear model in the space of all possible k-mers with up to in mismatches). The leave-one-out cross-validation (LOOCV) error rate indicates that approximately two-thirds of the observed regulatory behaviour can be inferred by the presence of particular motifs in the upstream promoter sequence. The analysis has yielded novel biological insight, which has since been confirmed experimentally in vivo.
Keywords :
biology computing; genetics; learning (artificial intelligence); macromolecules; sugar; Arahuiopsis thaliana; PLACE database; arabidopsis thaliana; eukaryotic organisms; feature selection; gene regulatory networks; glucose; k-mers; kernel learning methods; leave-one-out cross-validation error rate; mismatch kernels; promoter motifs; regularised kernel Fisher discriminant analysis; sugar regulation; transcriptional gene regulation; upstream promoter sequences; Biological system modeling; Error analysis; In vivo; Kernel; Learning systems; Organisms; Performance analysis; Predictive models; Spatial databases; Sugar;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1555824