• Title of article

    Classification from microarray data using probabilistic discriminant partial least squares with reject option

  • Author/Authors

    Botella، نويسنده , , Cristina and Ferré، نويسنده , , Joan and Boqué، نويسنده , , Ricard، نويسنده ,

  • Issue Information
    ماهنامه با شماره پیاپی سال 2009
  • Pages
    8
  • From page
    321
  • To page
    328
  • Abstract
    Microarrays are used to simultaneously determine the expressions of thousands of genes. An important application of microarrays is in the classification of samples into classes of interest (e.g. either healthy cells or tumour cells). Discriminant partial least squares (DPLS) has often been used for this purpose. In this paper, we describe an improvement to DPLS that uses kernel-based probability density functions and the Bayes rule to classify samples whilst keeping the option of not classifying the sample if this cannot be done with sufficient confidence. With this approach, those samples outside the boundaries of the known classes or from the ambiguity region between classes are rejected and only samples with a high probability of being correctly classified are indeed classified. The optimal model is found by simultaneously minimizing the misclassification and rejection costs. The method (p-DPLS with reject option) was tested with two datasets. For the human cancers dataset the accuracy (obtained by leave-one-out cross-validation) was improved from 97% to 99% when compared to p-DPLS without reject option. For the breast cancer dataset, p-DPLS with reject option was able to reject 100% of the test samples that did not belong to any of the modelled classes. These samples would have been misclassified if the reject option had not been considered.
  • Keywords
    Reject option , DPLS , Microarrays classification
  • Journal title
    Talanta
  • Serial Year
    2009
  • Journal title
    Talanta
  • Record number

    1658841