Title :
Design of an incremental clustering package for protein function and family analysis
Author :
Chen, Chien-Yu ; Juan, Hsueh-Fen ; Hsiao, Po-Jen ; Shui-Tein Chen ; Tseng, Hsiang-Wen ; Oyang, Yen-Jen
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
Protein clustering has been widely exploited to facilitate in-depth analysis of protein functions and families. We discuss the design of an incremental protein clustering package that provides comprehensive features for protein function and family analysis. Specifically, the package offers alternative options for carrying out high-quality protein clustering from different aspects. The incremental nature of the clustering algorithm is essential for efficient analysis of those contemporary protein databases whose sizes are growing rapidly. Concerning the quality of clustering results, experimental results from applying the incremental clustering algorithm to protein sequence analysis show that the incremental algorithm is able to identify protein sequence clusters that match protein families more consistently than the single-link algorithm, which is the most widely used hierarchical clustering algorithm for protein sequence analysis. We also address the implementation techniques employed to improve the system performance.
Keywords :
biology computing; molecular biophysics; pattern clustering; proteins; clustering algorithm; incremental algorithm; incremental clustering package; protein clustering package; protein databases; protein family analysis; protein functions; protein sequence analysis; Algorithm design and analysis; Biochemical analysis; Clustering algorithms; Computer science; Design engineering; Information analysis; Packaging; Protein engineering; Protein sequence; Sequences;
Conference_Titel :
Multimedia Software Engineering, 2003. Proceedings. Fifth International Symposium on
Conference_Location :
Taichung, Taiwan
Print_ISBN :
0-7695-2031-6
DOI :
10.1109/MMSE.2003.1254454