• DocumentCode
    2914243
  • Title

    Combining LPP with PCA for microarray data clustering

  • Author

    Chen, Chuanliang ; Bie, Rongfang ; Guo, Ping

  • Author_Institution
    Dept. of Comput. Sci., Beijing Normal Univ., Beijing
  • fYear
    2008
  • fDate
    1-6 June 2008
  • Firstpage
    2081
  • Lastpage
    2086
  • Abstract
    DNA microarray technique has produced large amount of gene expression data. To analyze these data, many excellent machine learning techniques have been proposed in recent related work. In this paper, we try to perform the clustering of microarray data by combining the recently proposed locality preserving projection (LPP) method with PCA, i.e. PCA-LPP. The comparison between PCA and PCA-LPP is performed based on two clustering algorithms, K-means and agglomerative hierarchical clustering. As we already known, clustering with the components extracted by PCA instead of the original variables does improve cluster quality. Moreover, our empirical study shows that by using LPP to perform further process the dimensions of components extracted by PCA can be further reduced and the quality of the clusters can be improved greatly meanwhile. Particularly, the first few components obtained by PCA-LPP capture more information of the cluster structure than those of PCA.
  • Keywords
    DNA; biology computing; learning (artificial intelligence); pattern clustering; principal component analysis; DNA microarray technique; agglomerative hierarchical clustering; cluster quality; gene expression data; locality preserving projection method; machine learning techniques; microarray data clustering; Evolutionary computation; Principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-1822-0
  • Electronic_ISBN
    978-1-4244-1823-7
  • Type

    conf

  • DOI
    10.1109/CEC.2008.4631074
  • Filename
    4631074