• DocumentCode
    2486520
  • Title

    A fuzzy c-means algorithm using a correlation metrics and gene ontology

  • Author

    Zhang, Mingrui ; Therneau, Terry ; McKenzie, Michael A. ; Li, Peter ; Yang, Ping

  • Author_Institution
    Comput. Sci. Dept., Winona State Univ., Winona, MN
  • fYear
    2008
  • fDate
    8-11 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    A fuzzy c-means algorithm was adapted for analyzing microarray data. The adaptation consisted of initialization of fuzzy centroids using gene ontology information and the use of Pearson correlation distance in the objective function. To initialize fuzzy centroids, we classified genes based on gene ontology terms and used the classified genes as initial fuzzy clusters. Pearson correlation distance becomes 0 if two genes are either positively or negatively correlated. The algorithm was applied to Yeast and lung cancer microarray datasets. It outperformed the conventional fuzzy c-means algorithm by associating more genes to functional groups.
  • Keywords
    biology computing; cancer; genetics; ontologies (artificial intelligence); pattern classification; pattern clustering; Pearson correlation distance; correlation metrics; fuzzy c-means algorithm; fuzzy centroids; gene classification; gene ontology; lung cancer microarray datasets; microarray data analysis; Biological processes; Cancer; Clustering algorithms; Data analysis; Euclidean distance; Fungi; Fuzzy sets; Gene expression; Lungs; Ontologies;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
  • Conference_Location
    Tampa, FL
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-2174-9
  • Electronic_ISBN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2008.4761672
  • Filename
    4761672