• DocumentCode
    8933
  • Title

    Local and Global Preserving Semisupervised Dimensionality Reduction Based on Random Subspace for Cancer Classification

  • Author

    Xianfa Cai ; Jia Wei ; Guihua Wen ; Zhiwen Yu

  • Author_Institution
    Sch. of Med. Inf. Eng., Guangdong Pharm. Univ., Guangzhou, China
  • Volume
    18
  • Issue
    2
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    500
  • Lastpage
    507
  • Abstract
    Precise cancer classification is essential to the successful diagnosis and treatment of cancers. Although semisupervised dimensionality reduction approaches perform very well on clean datasets, the topology of the neighborhood constructed with most existing approaches is unstable in the presence of high-dimensional data with noise. In order to solve this problem, a novel local and global preserving semisupervised dimensionality reduction based on random subspace algorithm marked as RSLGSSDR, which utilizes random subspace for semisupervised dimensionality reduction, is proposed. The algorithm first designs multiple diverse graphs on different random subspace of datasets and then fuses these graphs into a mixture graph on which dimensionality reduction is performed. As the mixture graph is constructed in lower dimensionality, it can ease the issues on graph construction on high-dimensional samples such that it can hold complicated geometric distribution of datasets as the diversity of random subspaces. Experimental results on public gene expression datasets demonstrate that the proposed RSLGSSDR not only has superior recognition performance to competitive methods, but also is robust against a wide range of values of input parameters.
  • Keywords
    cancer; graph theory; medical diagnostic computing; patient diagnosis; patient treatment; RSLGSSDR; cancer classification; cancer diagnosis; cancer treatment; global preserving semisupervised dimensionality reduction; local preserving semisupervised dimensionality reduction; multiple diverse graphs; neighborhood topology; noise; public gene expression datasets; random subspace algorithm; superior recognition performance; Accuracy; Cancer; Gene expression; Noise; Principal component analysis; Robustness; Tumors; Cancer classification; dimensionality reduction; graph construction; random subspace; semisupervised learning;
  • fLanguage
    English
  • Journal_Title
    Biomedical and Health Informatics, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    2168-2194
  • Type

    jour

  • DOI
    10.1109/JBHI.2013.2281985
  • Filename
    6600760