• DocumentCode
    1667647
  • Title

    Classification with minimum expected error over an uncertainty class of Gaussian distributions

  • Author

    Dalton, Lori

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Ohio State Univ., Columbus, OH, USA
  • fYear
    2013
  • Firstpage
    3462
  • Lastpage
    3466
  • Abstract
    In biomedicine, it is typical to find studies discriminating between types of pathology or stages of disease based on as few as 30 sample points and tens of thousands of genes. Unfortunately, out-of-the-box classification and error estimation rules come with no small-sample performance guarantees, which has greatly contributed to the crisis in biomarker reproducibility. Recent work addresses this by supplementing the data with expert biological knowledge via a prior distribution over an uncertainty class of feature-label distributions, and uses the resulting probabilistic framework to define minimum mean-square-error (MMSE) estimators for the misclassification rate of any fixed classifier, as well as the sample-conditioned MSE itself for arbitrary error estimators. Here, we use the same framework to also define minimum expected error (MEE) classifiers, completing a Bayesian optimized theory of classification. We also present examples on real genomic data resulting in classifiers that greatly outperform popular rules.
  • Keywords
    Bayes methods; Gaussian distribution; biology computing; diseases; genomics; least mean squares methods; pattern classification; uncertainty handling; Bayesian optimized theory; Gaussian distributions; MEE classifiers; MMSE estimators; biomarker reproducibility; biomedicine; disease; error estimation rules; expert biological knowledge; feature-label distributions; fixed classifier; genomic data; minimum expected error classifiers; minimum mean square error estimators; misclassification rate; out-of-the-box classification; pathology; probabilistic framework; sample-conditioned MSE; uncertainty class; Bayes methods; Bioinformatics; Error analysis; Genomics; Tin; Training; Uncertainty; Bayesian modeling; classification; error estimation; genomics; small samples;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6638301
  • Filename
    6638301