Title :
Mining Conditional Phosphorylation Motifs
Author :
Xiaoqing Liu ; Jun Wu ; Haipeng Gong ; Shengchun Deng ; Zengyou He
Author_Institution :
Sch. of Software, Dalian Univ. of Technol., Dalian, China
fDate :
Sept.-Oct. 1 2014
Abstract :
Phosphorylation motifs represent position-specific amino acid patterns around the phosphorylation sites in the set of phosphopeptides. Several algorithms have been proposed to uncover phosphorylation motifs, whereas the problem of efficiently discovering a set of significant motifs with sufficiently high coverage and non-redundancy still remains unsolved. Here we present a novel notion called conditional phosphorylation motifs. Through this new concept, the motifs whose over-expressiveness mainly benefits from its constituting parts can be filtered out effectively. To discover conditional phosphorylation motifs, we propose an algorithm called C-Motif for a non-redundant identification of significant phosphorylation motifs. C-Motif is implemented under the Apriori framework, and it tests the statistical significance together with the frequency of candidate motifs in a single stage. Experiments demonstrate that C-Motif outperforms some current algorithms such as MMFPh and Motif-All in terms of coverage and non-redundancy of the results and efficiency of the execution. The source code of C-Motif is available at: https://sourceforge. net/projects/cmotif/.
Keywords :
biochemistry; biology computing; data mining; molecular biophysics; proteins; statistical analysis; Apriori framework; C-Motif; MMFPh; Motif-All; conditional phosphorylation motifs mining; nonredundant identification; phosphopeptides; phosphorylation sites; position-specific amino acid patterns; statistical significance; Amino acids; Bioinformatics; Computational biology; Data mining; Peptides; Proteins; Phosphorylation motif; data mining; frequent pattern; protein phosphorylation;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2014.2321400