DocumentCode
457263
Title
Combining Generative and Discriminative Methods for Pixel Classification with Multi-Conditional Learning
Author
Kelm, B. Michael ; Pal, Chris ; McCallum, Andrew
Author_Institution
Interdisciplinary Center for Sci. Comput., Heidelberg Univ.
Volume
2
fYear
0
fDate
0-0 0
Firstpage
828
Lastpage
832
Abstract
It is possible to broadly characterize two approaches to probabilistic modeling in terms of generative and discriminative methods. Provided with sufficient training data the discriminative approach is expected to yield superior accuracy as compared to the analogous generative model since no modeling power is expended on the marginal distribution of the features. Conversely, if the model is accurate the generative approach can perform better with less data. In general it is less vulnerable to overfitting and allows one to more easily specify meaningful priors on the model parameters. We investigate multi-conditional learning - a method combining the merits of both approaches. Through specifying a joint distribution over classes and features we derive a family of models with analogous parameters. Parameter estimates are found by optimizing an objective function consisting of a weighted combination of conditional log-likelihoods. Systematic experiments in the context of foreground/background pixel classification with the Microsoft-Berkeley segmentation database using mixtures of factor analyzers illustrate tradeoffs between classifier complexity, the amount of training data and generalization accuracy. We show experimentally that this approach can lead to models with better generalization performance than purely generative or discriminative approaches
Keywords
image classification; learning (artificial intelligence); probability; Microsoft-Berkeley segmentation database; analogous parameters; background pixel classification; classifier complexity; conditional log-likelihoods; discriminative methods; foreground pixel classification; generative methods; joint distribution; multiconditional learning; objective function; parameter estimates; probabilistic modeling; Character generation; Computer science; Context modeling; Linear discriminant analysis; Logistics; Parameter estimation; Power generation; Power system modeling; Scientific computing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location
Hong Kong
ISSN
1051-4651
Print_ISBN
0-7695-2521-0
Type
conf
DOI
10.1109/ICPR.2006.384
Filename
1699333
Link To Document