Title :
Exact Sample Conditioned MSE Performance of the Bayesian MMSE Estimator for Classification Error—Part I: Representation
Author :
Dalton, Lori A. ; Dougherty, Edward R.
Author_Institution :
Dept. of Electr. & Comput. Eng., Texas A&M Univ., College Station, TX, USA
fDate :
5/1/2012 12:00:00 AM
Abstract :
In recent years, biomedicine has been faced with difficult high-throughput small-sample classification problems. In such settings, classifier error estimation becomes a critical issue because training and testing must be done on the same data. A recently proposed error estimator places the problem in a signal estimation framework in the presence of uncertainty, permitting a rigorous solution optimal in a minimum-mean-square error sense. The uncertainty in this model is relative to the parameters of the feature-label distributions, resulting in a Bayesian approach to error estimation. Closed form solutions are available for two important problems: discrete classification with Dirichlet priors and linear classification of Gaussian distributions with normal-inverse-Wishart priors. In this work, Part I of a two-part study, we introduce the theoretical mean-square-error (MSE) conditioned on the observed sample of any estimate of the classifier error, including the Bayesian error estimator, for both Bayesian models. Thus, Bayesian error estimation has a unique advantage in that its mathematical framework naturally gives rise to a practical expected measure of performance given an observed sample. In Part II of the study we examine consistency of the error estimator, demonstrate various MSE properties, and apply the conditional MSE to censored sampling.
Keywords :
Bayes methods; mean square error methods; medical signal processing; signal classification; signal representation; Bayesian MMSE estimator; Bayesian error estimator; Dirichlet priors; Gaussian distribution linear classification; biomedicine; classification error estimation; closed form solutions; discrete classification; exact sample conditioned MSE performance; feature-label distributions; high-throughput small-sample classification problems; mathematical framework; minimum-mean-square error; normal-inverse-Wishart priors; signal estimation framework; Bayesian methods; Bioinformatics; Error analysis; Joints; Mathematical model; Tin; Uncertainty; Bayesian estimation; classification; error estimation; genomics; minimum mean-square estimation; small samples;
Journal_Title :
Signal Processing, IEEE Transactions on
DOI :
10.1109/TSP.2012.2184101