DocumentCode :
3259454
Title :
Application of Graph-based Data Mining to Metabolic Pathways
Author :
You, Chang Hun ; Holder, Lawrence B. ; Cook, Diane J.
Author_Institution :
Sch. of Electr. Eng. & Comput. Sci., Washington State Univ., Pullman, WA
fYear :
2006
fDate :
Dec. 2006
Firstpage :
169
Lastpage :
173
Abstract :
We present a method for finding biologically meaningful patterns on metabolic pathways using the SUBDUE graph-based relational learning system. A huge amount of biological data that has been generated by long-term research encourages us to move our focus to a systems-level understanding of bio-systems. A biological network, containing various biomolecules and their relationships, is a fundamental way to describe bio-systems. Multirelational data mining finds the relational patterns in both the entity attributes and relations in the data. A graph consisting of vertices and edges between these vertices is a natural data structure to represent biological networks. This paper presents a graph representation of metabolic pathways to contain all features, and describes the application of graph-based relational learning algorithms in both supervised and unsupervised scenarios. Supervised learning finds the unique substructures in a specific type of pathway, which help us understand better how pathways differ. Unsupervised learning shows hierarchical clusters that describe the common substructures in a specific type of pathway, which allow us to better understand the common features in pathways
Keywords :
biology computing; data mining; graph theory; learning (artificial intelligence); molecular biophysics; SUBDUE graph-based relational learning system; biological data; biological network; biologically meaningful patterns; biomolecules; data structure; entity attributes; graph representation; graph-based data mining; hierarchical clusters; metabolic pathways; multirelational data mining; relational patterns; supervised learning; supervised scenario; unsupervised learning; unsupervised scenario; Application software; Bioinformatics; Biological systems; Biology; Computer science; Data mining; Learning systems; Proteins; Supervised learning; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
Type :
conf
DOI :
10.1109/ICDMW.2006.31
Filename :
4063619
Link To Document :
بازگشت