Title :
GO-based Gene Expression Cluster Validation
Author_Institution :
Dept. of Radiol. &
Abstract :
Fuzzy C-Means (FCM) algorithm has been widely used in cluster analysis of gene expression data. It can converge rapidly and provide more information regarding relationships within the data thanks to the usage of fuzzy sets to represent the degrees of cluster membership for every data point. However, FCM has a shortcoming in that it requires a priori specification of cluster number and cluster validation. Most cluster validity indices are based on the data themselves and may not be applicable for gene expression data. In this paper, we propose a Bayesian method using Gene Ontology (GO) annotations for gene expression cluster validation. We show that our method outperforms popular validity indices on gene expression datasets.
Keywords :
"Semantics","Indexes","Gene expression","Biological system modeling","Data models","Ontologies"
Conference_Titel :
Computational Science and Computational Intelligence (CSCI), 2015 International Conference on
DOI :
10.1109/CSCI.2015.34