DocumentCode
2486520
Title
A fuzzy c-means algorithm using a correlation metrics and gene ontology
Author
Zhang, Mingrui ; Therneau, Terry ; McKenzie, Michael A. ; Li, Peter ; Yang, Ping
Author_Institution
Comput. Sci. Dept., Winona State Univ., Winona, MN
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
A fuzzy c-means algorithm was adapted for analyzing microarray data. The adaptation consisted of initialization of fuzzy centroids using gene ontology information and the use of Pearson correlation distance in the objective function. To initialize fuzzy centroids, we classified genes based on gene ontology terms and used the classified genes as initial fuzzy clusters. Pearson correlation distance becomes 0 if two genes are either positively or negatively correlated. The algorithm was applied to Yeast and lung cancer microarray datasets. It outperformed the conventional fuzzy c-means algorithm by associating more genes to functional groups.
Keywords
biology computing; cancer; genetics; ontologies (artificial intelligence); pattern classification; pattern clustering; Pearson correlation distance; correlation metrics; fuzzy c-means algorithm; fuzzy centroids; gene classification; gene ontology; lung cancer microarray datasets; microarray data analysis; Biological processes; Cancer; Clustering algorithms; Data analysis; Euclidean distance; Fungi; Fuzzy sets; Gene expression; Lungs; Ontologies;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761672
Filename
4761672
Link To Document