• DocumentCode
    3252036
  • Title

    Optimal Bayesian feature selection

  • Author

    Dalton, Lori

  • Author_Institution
    Dept. of Biomed. Inf., Ohio State Univ., Columbus, OH, USA
  • fYear
    2013
  • fDate
    3-5 Dec. 2013
  • Firstpage
    65
  • Lastpage
    68
  • Abstract
    Biomarker discovery and classification in medical applications both typically involve feature selection applied to a small-sample high-dimensional dataset. Recent work has proposed a framework to integrate a prior over an uncertainty class of parameterized feature-label distributions with training data to obtain optimal classifiers, MMSE classifier error estimates, and evaluate the MSE of error estimates. However, feature selection has not been investigated rigorously in this paradigm. In the present work, we begin to address optimal feature selection in a Bayesian framework via a sparsity inducing prior that assumes the number of “good” features is small. From modeling assumptions and this prior we derive expressions for the sample-conditioned probability mass over good feature sets. It thus becomes possible to find feature sets that are optimal relative to maximal posterior probability. Furthermore, one may provide this probability along with a given feature set, and thereby evaluate the validity and reliability of the results.
  • Keywords
    Bayes methods; data handling; medical computing; Bayesian framework; biomarker discovery; medical applications; optimal Bayesian feature selection; Approximation methods; Bayes methods; Bioinformatics; Error analysis; Training; Uncertainty; Vectors; Bayesian modeling; Feature selection; biomarker discovery;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Global Conference on Signal and Information Processing (GlobalSIP), 2013 IEEE
  • Conference_Location
    Austin, TX
  • Type

    conf

  • DOI
    10.1109/GlobalSIP.2013.6736814
  • Filename
    6736814