DocumentCode :
3316382
Title :
Graph Alignment: Fuzzy Pattern Mining for the Structural Analysis of Protein Active Sites
Author :
Hullermeier, Eyke ; Weskamp, Nils ; Klebe, Gerhard ; Kuhn, Daniel
Author_Institution :
Marburg Univ., Marburg
fYear :
2007
fDate :
23-26 July 2007
Firstpage :
1
Lastpage :
6
Abstract :
Graphs are frequently used to describe the geometry and also the physicochemical composition of protein active sites. Here, the concept of graph alignment as a novel method for the structural analysis of protein binding pockets is presented. Using inexact, approximate graph-matching techniques, our method enables the robust identification of fuzzily conserved areas in binding pockets. Thus, using multiple graph alignments, it is possible to characterize functional protein families independent of sequence or fold homology. This paper first introduces the problem of graph alignment in a formal way and discusses algorithmic solutions for this problem. Then, it is shown how the calculated graph alignments can be analyzed to identify structural features that are characteristic for a given protein family. In this connection, the related concept of a fuzzy consensus graph is introduced. The methods are applied to a substantial high-quality subset of the PDB database and their ability to successfully characterize and classify 10 highly populated functional protein families is shown.
Keywords :
biochemistry; biology computing; data mining; fuzzy set theory; graph theory; pattern classification; proteins; PDB database; approximate graph-matching technique; functional protein family classification; fuzzy consensus graph; fuzzy pattern mining; graph alignment; physicochemical composition; protein active sites structural analysis; Database systems; Drugs; Geometry; Pattern analysis; Pharmaceuticals; Proteins; Robustness; Sequences; Shape; Spatial databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems Conference, 2007. FUZZ-IEEE 2007. IEEE International
Conference_Location :
London
ISSN :
1098-7584
Print_ISBN :
1-4244-1209-9
Electronic_ISBN :
1098-7584
Type :
conf
DOI :
10.1109/FUZZY.2007.4295409
Filename :
4295409
Link To Document :
بازگشت