• DocumentCode
    1922356
  • Title

    Bagged ensembles of Support Vector Machines for gene expression data analysis

  • Author

    Valentini, Giorgio ; Muselli, Marco ; Ruffino, Francesca

  • Author_Institution
    Dipt. di Sci. dell´´ Inf., Univ. degli Studi di Milano, Milan, Italy
  • Volume
    3
  • fYear
    2003
  • fDate
    20-24 July 2003
  • Firstpage
    1844
  • Abstract
    Extracting information from gene expression data is a difficult task, as these data are characterized by very high dimensional, small sized, samples and large degree of biological variability. However, a possible way of dealing with the curse of dimensionality is offered by feature selection algorithms, while variance problems arising from small samples and biological variability can be addressed through ensemble methods based on resampling techniques. These two approaches have been combined to improve the accuracy of Support Vector Machines (SVM) in the classification of malignant tissues from DNA microarray data. To assess the accuracy and the confidence of the predictions performed proper measures have been introduced. Presented results show that bagged ensembles of SVM are more reliable and achieve equal or better classification accuracy with respect to single SVM, whereas feature selection methods can further enhance classification accuracy.
  • Keywords
    cancer; data analysis; learning (artificial intelligence); medical diagnostic computing; pattern classification; support vector machines; DNA microarray data; SVM; deoxyribonucleic acid; ensemble methods; feature selection algorithms; gene expression data analysis; information extraction; malignant tissues classification; resampling techniques; support vector machines; variance problems; Bagging; Cancer; DNA; Data analysis; Gene expression; Machine learning; Performance evaluation; Support vector machine classification; Support vector machines; Telecommunications;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2003. Proceedings of the International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-7898-9
  • Type

    conf

  • DOI
    10.1109/IJCNN.2003.1223688
  • Filename
    1223688