• DocumentCode
    10062
  • Title

    Nonnegative Least-Squares Methods for the Classification of High-Dimensional Biological Data

  • Author

    Yifeng Li ; Ngom, Alioune

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Windsor, Windsor, ON, Canada
  • Volume
    10
  • Issue
    2
  • fYear
    2013
  • fDate
    March-April 2013
  • Firstpage
    447
  • Lastpage
    456
  • Abstract
    Microarray data can be used to detect diseases and predict responses to therapies through classification models. However, the high dimensionality and low sample size of such data result in many computational problems such as reduced prediction accuracy and slow classification speed. In this paper, we propose a novel family of nonnegative least-squares classifiers for high-dimensional microarray gene expression and comparative genomic hybridization data. Our approaches are based on combining the advantages of using local learning, transductive learning, and ensemble learning, for better prediction performance. To study the performances of our methods, we performed computational experiments on 17 well-known data sets with diverse characteristics. We have also performed statistical comparisons with many classification techniques including the well-performing SVM approach and two related but recent methods proposed in literature. Experimental results show that our approaches are faster and achieve generally a better prediction performance over compared methods.
  • Keywords
    bioinformatics; genetics; genomics; lab-on-a-chip; learning (artificial intelligence); least mean squares methods; statistical analysis; support vector machines; SVM approach; classification model; comparative genomic hybridization data; computational experiment; computational problem; disease detection; ensemble learning; high-dimensional biological data classification; high-dimensional microarray gene expression; local learning; microarray data; nonnegative least-squares classifier; nonnegative least-squares method; reduced prediction accuracy; statistical comparison; support vector machine; therapy response prediction; transductive learning; Algorithms; Classificaiton; Diseases; Least squares methods; Medical information systems; Medicine; algorithms; classifier design and evaluation; Algorithms; Computational Biology; Databases, Genetic; Gene Expression Profiling; Humans; Least-Squares Analysis; Neoplasms; Oligonucleotide Array Sequence Analysis; Support Vector Machines;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2013.30
  • Filename
    6494577