Title :
SLIDER: A Generic Metaheuristic for the Discovery of Correlated Motifs in Protein-Protein Interaction Networks
Author :
Boyen, Peter ; Van Dyck, Dries ; Neven, Frank ; Van Ham, Roeland C H J ; Van Dijk, Aalt D J
Author_Institution :
Hasselt Univ., Diepenbeek, Belgium
Abstract :
Correlated motif mining (cmm) is the problem of finding overrepresented pairs of patterns, called motifs, in sequences of interacting proteins. Algorithmic solutions for cmm thereby provide a computational method for predicting binding sites for protein interaction. In this paper, we adopt a motif-driven approach where the support of candidate motif pairs is evaluated in the network. We experimentally establish the superiority of the Chi-square-based support measure over other support measures. Furthermore, we obtain that cmm is an np-hard problem for a large class of support measures (including Chi-square) and reformulate the search for correlated motifs as a combinatorial optimization problem. We then present the generic metaheuristic slider which uses steepest ascent with a neighborhood function based on sliding motifs and employs the Chi-square-based support measure. We show that slider outperforms existing motif-driven cmm methods and scales to large protein-protein interaction networks. The slider-implementation and the data used in the experiments are available on http://bioinformatics.uhasselt.be.
Keywords :
biochemistry; bioinformatics; data mining; molecular biophysics; molecular configurations; optimisation; proteins; statistical analysis; Chi-square-based support measure; SLIDER; binding sites; combinatorial optimization problem; correlated motif mining; generic metaheuristic; neighborhood function; np-hard problem; protein sequences; protein-protein interaction networks; steepest ascent; Amino acids; Bioinformatics; Coordinate measuring machines; Optimization; Polynomials; Proteins; Graphs and networks; biology and genetics.; Algorithms; Amino Acid Motifs; Chi-Square Distribution; Computational Biology; Databases, Protein; Fungal Proteins; Humans; Protein Interaction Mapping; Protein Interaction Maps; Proteins; Sequence Analysis, Protein;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2011.17