Title :
On the structure of strict sense Bayesian cost functions and its applications
Author :
Cid-Sueiro, Jesús ; Figueiras-Vidal, Aníbal R.
Author_Institution :
Dept. de Tecnologias de las Comunicaciones, Univ. Carlos III, Madrid, Spain
fDate :
5/1/2001 12:00:00 AM
Abstract :
In the context of classification problems, the paper analyzes the general structure of the strict sense Bayesian (SSB) cost functions, those having a unique minimum when the soft decisions are equal to the posterior class probabilities. We show that any SSB cost is essentially the sum of a generalized measure of entropy, which does not depend on the targets, and an error component. Symmetric cost functions are analyzed in detail. Our results provide a further insight on the behavior of this family of objective functions and are the starting point for the exploration of novel algorithms. Two applications are proposed. First, the use of asymmetric SSB cost functions for posterior probability estimation in non-maximum a posteriori (MAP) decision problems. Second, a novel entropy minimization principle for hybrid learning: use labeled data to minimize the cost function, and unlabeled data to minimize the corresponding entropy measure
Keywords :
Bayes methods; decision theory; entropy; estimation theory; learning (artificial intelligence); minimisation; neural nets; probability; classification problems; entropy minimization principle; generalized entropy measure; hybrid learning; labeled data; nonmaximum a posteriori decision problems; objective functions; posterior class probabilities; posterior probability estimation; soft decisions; strict sense Bayesian cost functions; symmetric cost functions; unique minimum; unlabeled data; Amplitude modulation; Bayesian methods; Cost function; Data analysis; Entropy; Error probability; Machine learning; Neural networks; Object detection; Training data;
Journal_Title :
Neural Networks, IEEE Transactions on