Title :
Identifying Interface Elements Implied in Protein-Protein Interactions Using Statistical Tests and Frequent Item Sets
Author :
Martin, Christine ; Cornuejols, A.
Author_Institution :
LIMSI, Univ. d´´Orsay Paris Sud, Orsay
Abstract :
Understanding what are the characteristics of protein-protein interfaces is at the core of numerous applications.This paper introduces a method in which the proteins are described with surfacic geometrical elements. Starting from a database of known interfaces, the method produces the elements and combinations thereof that are characteristic of the interfaces. This is done thanks to a frequent item set technique and the use of statistical tests to ensure a marked difference with a null hypothesis. This approach allows one to easily interpret the results, as compared to techniques that operate as ldquoblack-boxesrdquo. Furthermore, it is naturally adapted to discover disjunctive concepts, i.e. different underlying processes. The results obtained on a set of 459 protein-protein interfaces from the PDB database confirm that the findings are consistent with current knowledge about protein-protein interfaces.
Keywords :
biology computing; proteins; statistical analysis; PDB database; frequent item sets; interface elements; protein-protein interaction; statistical tests; surfacic geometrical elements; Bioinformatics; Data mining; Dictionaries; Learning systems; Machine learning; Predictive models; Proteins; Spatial databases; Testing; Transaction databases; data mining; frequent item sets; protein-protein interactions;
Conference_Titel :
Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3452-7
DOI :
10.1109/BIBM.2008.68