• DocumentCode
    3239396
  • Title

    Effect of separate sampling on classification and the minimax criterion

  • Author

    Shahrokh Esfahani, Mohammad ; Dougherty, Edward

  • Author_Institution
    Dept. of Electr. Eng., Texas A&M Univ., College Station, TX, USA
  • fYear
    2013
  • fDate
    17-19 Nov. 2013
  • Firstpage
    72
  • Lastpage
    73
  • Abstract
    It is commonplace in bioinformatics (and elsewhere) to build a classifier from sample data in which the sample sizes of the classes are not random; that is, they are selected prior to sampling. The result is that there is no estimate of the prior class probabilities available from the data. In this paper, we find an analytic result for the minimax solution for the class prior probabilities for a general Neyman-Pearson induced classifier. From that we derive Anderson´s classical minimax prior probability “estimate.” Using synthetic and real data, we demonstrate the degradation in classifier performance from using inaccurate values for the prior probabilities.
  • Keywords
    bioinformatics; minimax techniques; pattern classification; probability; sampling methods; Anderson classical minimax prior probability; bioinformatics; class probabilities; classifier performance; general Neyman-Pearson induced classifier; minimax criterion; minimax solution; sampling method; Bioinformatics; Computational modeling; Covariance matrices; Error analysis; Sociology; Tin; Separate sampling; classification accuracy; minimax;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics (GENSIPS), 2013 IEEE International Workshop on
  • Conference_Location
    Houston, TX
  • Print_ISBN
    978-1-4799-3461-4
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2013.6735935
  • Filename
    6735935