Title :
A novel OPTOC-based clustering algorithm for gene expression data analysis
Author :
Liew, Alan WeeChung ; Yan, Hong ; Wu, Shuanhu
Author_Institution :
Dept. of Comput. Eng. & Inf. Eng., City Univ. of Hong Kong, Kowloon, China
Abstract :
Cluster analysis of gene expression data is useful for identifying biologically relevant groups of genes. However, finding the correct clusters in the data and estimating the correct number of clusters are still two largely unsolved problems. In this paper, we propose a new clustering framework that is able to address both these problems. By using the one-prototype-take-one-cluster (OPTOC) competitive learning paradigm, the proposed algorithm can find natural clusters in the input data, and the clustering solution is not sensitive to initialization. In order to estimate the number of distinct clusters in the data, an over-clustering and merging strategy is proposed. For validation, we applied the new algorithm to both simulated gene expression data and real gene expression data (expression changes during yeast cell cycle). The results clearly indicate the effectiveness of our method.
Keywords :
biology computing; competitive algorithms; data analysis; genetics; pattern clustering; cluster analysis; gene expression data analysis; one-prototype-take-one-cluster competitive learning paradigm; Biological system modeling; Clustering algorithms; DNA; Data analysis; Data engineering; Fungi; Gene expression; Information analysis; Merging; Prototypes;
Conference_Titel :
Information, Communications and Signal Processing, 2003 and Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint Conference of the Fourth International Conference on
Print_ISBN :
0-7803-8185-8
DOI :
10.1109/ICICS.2003.1292701