Title :
A fuzzy c-means algorithm using a correlation metrics and gene ontology
Author :
Zhang, Mingrui ; Therneau, Terry ; McKenzie, Michael A. ; Li, Peter ; Yang, Ping
Author_Institution :
Comput. Sci. Dept., Winona State Univ., Winona, MN
Abstract :
A fuzzy c-means algorithm was adapted for analyzing microarray data. The adaptation consisted of initialization of fuzzy centroids using gene ontology information and the use of Pearson correlation distance in the objective function. To initialize fuzzy centroids, we classified genes based on gene ontology terms and used the classified genes as initial fuzzy clusters. Pearson correlation distance becomes 0 if two genes are either positively or negatively correlated. The algorithm was applied to Yeast and lung cancer microarray datasets. It outperformed the conventional fuzzy c-means algorithm by associating more genes to functional groups.
Keywords :
biology computing; cancer; genetics; ontologies (artificial intelligence); pattern classification; pattern clustering; Pearson correlation distance; correlation metrics; fuzzy c-means algorithm; fuzzy centroids; gene classification; gene ontology; lung cancer microarray datasets; microarray data analysis; Biological processes; Cancer; Clustering algorithms; Data analysis; Euclidean distance; Fungi; Fuzzy sets; Gene expression; Lungs; Ontologies;
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
DOI :
10.1109/ICPR.2008.4761672