DocumentCode :
2114558
Title :
A network clustering algorithm for detection of protein families
Author :
Jiang Xie ; Minchao Wang ; Dongbo Dai ; Huiran Zhang ; Wu Zhang
Author_Institution :
Sch. of Comput. Eng. & Sci., Shanghai Univ., Shanghai, China
fYear :
2012
fDate :
Aug. 28 2012-Sept. 1 2012
Firstpage :
6329
Lastpage :
6332
Abstract :
Detection of protein families in large scale database is a difficult but important biological problem. Computational clustering methods can effectively address the problem. Although there exist many clustering algorithms, most of them are just based on the threshold. Their computational performances are affected by the weight distribution greatly, and they are only valid for some special networks. A new network clustering algorithm, Markov Finding and Clustering (MFC), is proposed to cluster the proteins into their functionally specific families accurately in this paper. The MFC algorithm makes an improvement in the random walk process and reduces the affection of the noise on the clustering result. It has a good performance on these networks which are not well addressed by existing algorithms sensitive to the noise. Finally, experiments on the protein sequence datasets demonstrate that the algorithm is effective in the detection of protein families and has a better performance than the current algorithms.
Keywords :
Markov processes; biology computing; molecular biophysics; probability; proteins; Markov clustering; Markov finding; biological problem; computational clustering methods; computational performances; large scale database; network clustering algorithm; noise; protein cluster; protein family detection; protein sequence datasets; random walk processing; weight distribution; Accuracy; Bioinformatics; Clustering algorithms; Educational institutions; Legged locomotion; Noise; Proteins; Algorithms; Cluster Analysis; Markov Chains; Proteins;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
Conference_Location :
San Diego, CA
ISSN :
1557-170X
Print_ISBN :
978-1-4244-4119-8
Electronic_ISBN :
1557-170X
Type :
conf
DOI :
10.1109/EMBC.2012.6347441
Filename :
6347441
Link To Document :
بازگشت