Title :
SLIDER: Mining Correlated Motifs in Protein-Protein Interaction Networks
Author :
Boyen, Peter ; Neven, Frank ; Van Dyck, Dries ; Van Dijk, Aalt D J ; Van Ham, Roeland C H J
Abstract :
Correlated motif mining (CMM) is the problem to find overrepresented pairs of patterns, called motif pairs, in interacting protein sequences. Algorithmic solutions for CMM thereby provide a computational method for predicting binding sites for protein interaction. In this paper, we adopt a motif-driven approach where the support of candidate motif pairs is evaluated in the network. We experimentally establish the superiority of the Chi-square-based support measure over other support measures. Furthermore, we obtain that CMM is an NP-hard problem for a large class of support measures (including Chi-square) and reformulate the search for correlated motifs as a combinatorial optimization problem. We then present the method SLIDER which uses local search with a neighborhood function based on sliding motifs and employs the Chi-square-based support measure. We show that SLIDER outperforms existing motif-driven CMM methods and scales to large protein-protein interaction networks.
Keywords :
algorithmic languages; correlation methods; genetics; graph theory; optimisation; Chisquare based support measure; NP hard problem; SLIDER; algorithmic solutions; combinatorial optimization problem; correlated motif mining; interacting protein sequences; large class support measures; local search neighborhood; mining correlated motifs; motif driven approach; motif pairs; predicting binding sites; protein interaction networks; provide computational method; Bioinformatics; Biological information theory; Coordinate measuring machines; Data mining; Fungi; Humans; Large-scale systems; NP-hard problem; Proteins; Scalability; Correlated motifs; PPI networks; local search;
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
DOI :
10.1109/ICDM.2009.92