• DocumentCode
    667309
  • Title

    Ensemble learning and hierarchical data representation for microarray classification

  • Author

    Bosio, Mattia ; Bellot, Pau ; Salembier, Philippe ; Oliveras Vergeas, Albert

  • Author_Institution
    Dept. of Signal Theor. & Commun., Tech. Univ. of Catalonia UPC, Barcelona, Spain
  • fYear
    2013
  • fDate
    10-13 Nov. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The microarray data classification is an open and active research field. The development of more accurate algorithms is of great interest and many of the developed techniques can be straightforwardly applied in analyzing different kinds of omics data. In this work, an ensemble learning algorithm is applied within a classification framework that already got good predictive results. Ensemble techniques take individual experts, (i.e. classifiers), to combine them to improve the individual expert results with a voting scheme. In this case, a thinning algorithm is proposed which starts by using all the available experts and removes them one by one focusing on improving the ensemble vote. Two versions of a state of the art ensemble thinning algorithm have been tested and three key elements have been introduced to work with microarray data: the ensemble cohort definition, the nonexpert notion, which defines a set of excluded expert from the thinning process, and a rule to break ties in the thinning process. Experiments have been done on seven public datasets from the Microarray Quality Control study, MAQC. The proposed key elements have shown to be useful for the prediction performance and the studied ensemble technique shown to improve the state of the art results by producing classifiers with better predictions.
  • Keywords
    biology computing; classification; data handling; lab-on-a-chip; learning (artificial intelligence); quality control; MAQC; active research field; classification framework; classifiers; ensemble cohort definition; ensemble learning algorithm; ensemble thinning algorithm; ensemble vote; hierarchical data representation; microarray data classification; microarray quality control; nonexpert notion; omics data; public datasets; thinning process; Accuracy; Gene expression; Merging; Prediction algorithms; Protocols; Quality control; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
  • Conference_Location
    Chania
  • Type

    conf

  • DOI
    10.1109/BIBE.2013.6701647
  • Filename
    6701647