DocumentCode
2114558
Title
A network clustering algorithm for detection of protein families
Author
Jiang Xie ; Minchao Wang ; Dongbo Dai ; Huiran Zhang ; Wu Zhang
Author_Institution
Sch. of Comput. Eng. & Sci., Shanghai Univ., Shanghai, China
fYear
2012
fDate
Aug. 28 2012-Sept. 1 2012
Firstpage
6329
Lastpage
6332
Abstract
Detection of protein families in large scale database is a difficult but important biological problem. Computational clustering methods can effectively address the problem. Although there exist many clustering algorithms, most of them are just based on the threshold. Their computational performances are affected by the weight distribution greatly, and they are only valid for some special networks. A new network clustering algorithm, Markov Finding and Clustering (MFC), is proposed to cluster the proteins into their functionally specific families accurately in this paper. The MFC algorithm makes an improvement in the random walk process and reduces the affection of the noise on the clustering result. It has a good performance on these networks which are not well addressed by existing algorithms sensitive to the noise. Finally, experiments on the protein sequence datasets demonstrate that the algorithm is effective in the detection of protein families and has a better performance than the current algorithms.
Keywords
Markov processes; biology computing; molecular biophysics; probability; proteins; Markov clustering; Markov finding; biological problem; computational clustering methods; computational performances; large scale database; network clustering algorithm; noise; protein cluster; protein family detection; protein sequence datasets; random walk processing; weight distribution; Accuracy; Bioinformatics; Clustering algorithms; Educational institutions; Legged locomotion; Noise; Proteins; Algorithms; Cluster Analysis; Markov Chains; Proteins;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
Conference_Location
San Diego, CA
ISSN
1557-170X
Print_ISBN
978-1-4244-4119-8
Electronic_ISBN
1557-170X
Type
conf
DOI
10.1109/EMBC.2012.6347441
Filename
6347441
Link To Document