DocumentCode :
3104897
Title :
Biclustering Protein Complex Interactions with a Biclique Finding Algorithm
Author :
Ding, Chris ; Zhang, Ya ; Li, Tao ; Holbrook, Stephen R.
Author_Institution :
Lawrence Berkeley Nat´´l Lab., Berkeley, CA
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
178
Lastpage :
187
Abstract :
Biclustering has many applications in text mining, Web clickstream mining, and bioinformatics. When data entries are binary, the tightest biclusters become bicliques. We propose a flexible and highly efficient algorithm to compute bicliques. We first generalize the Motzkin-Straus formalism for computing the maximal clique from L1 constraint to Lp constraint, which enables us to provide a generalized Motzkin-Straus formalism for computing maximal-edge bicliques. By adjusting parameters, the algorithm can favor biclusters with more rows less columns, or vice verse, thus increasing the flexibility of the targeted biclusters. We then propose an algorithm to solve the generalized Motzkin-Straus optimization problem. The algorithm is provably convergent and has a computational complexity of O(/E/) where /E/ is the number of edges. Using this algorithm, we bicluster the yeast protein complex interaction network. We find that biclustering protein complexes at the protein level does not clearly reflect the functional linkage among protein complexes in many cases, while biclustering at protein domain level can reveal many underlying linkages. We show several new biologically significant results.
Keywords :
biology computing; computational complexity; data mining; pattern clustering; Web clickstream mining; biclique finding; biclustering protein complex interactions; bioinformaticMotzkin-Straus formalisms; computational complexity; generalized Motzkin-Straus optimization problem; text mining; yeast protein complex interaction network; Bioinformatics; Bipartite graph; Computational complexity; Couplings; Data mining; Fungi; Itemsets; Phylogeny; Proteins; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
ISSN :
1550-4786
Print_ISBN :
0-7695-2701-7
Type :
conf
DOI :
10.1109/ICDM.2006.27
Filename :
4053046
Link To Document :
بازگشت