• DocumentCode
    1431393
  • Title

    A New Measure of Classifier Performance for Gene Expression Data

  • Author

    Hanczar, Blaise ; Bar-Hen, Avner

  • Author_Institution
    LIPADE, Univ. Paris Descartes, Paris, France
  • Volume
    9
  • Issue
    5
  • fYear
    2012
  • Firstpage
    1379
  • Lastpage
    1386
  • Abstract
    One of the major aims of many microarray experiments is to build discriminatory diagnosis and prognosis models. A large number of supervised methods have been proposed in literature for microarray-based classification for this purpose. Model evaluation and comparison is a critical issue and, the most of the time, is based on the classification cost. This classification cost is based on the costs of false positives and false negative, that are generally unknown in diagnostics problems. This uncertainty may highly impact the evaluation and comparison of the classifiers. We propose a new measure of classifier performance that takes account of the uncertainty of the error. We represent the available knowledge about the costs by a distribution function defined on the ratio of the costs. The performance of a classifier is therefore computed over the set of all possible costs weighted by their probability distribution. Our method is tested on both artificial and real microarray data sets. We show that the performance of classifiers is very depending of the ratio of the classification costs. In many cases, the best classifier can be identified by our new measure whereas the classic error measures fail.
  • Keywords
    biology computing; genetics; genomics; lab-on-a-chip; pattern classification; probability; artificial microarray data sets; classifier performance; discriminatory diagnosis; gene expression data; microarray-based classification; probability distribution; prognosis models; real microarray data sets; Bioinformatics; Computational biology; Cost function; Error analysis; Measurement uncertainty; Support vector machines; Training; Classifier performance; gene expression.; microarray analysis; supervised classification; Gene Expression; Gene Expression Profiling; Models, Theoretical; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2012.21
  • Filename
    6138847