DocumentCode :
28600
Title :
Knowledge Fusion for Probabilistic Generative Classifiers with Data Mining Applications
Author :
Fisch, Dominik ; Kalkowski, Edgar ; Sick, Bernhard
Author_Institution :
Intell. Embedded Syst. Lab., Univ. of Kassel, Kassel, Germany
Volume :
26
Issue :
3
fYear :
2014
fDate :
Mar-14
Firstpage :
652
Lastpage :
666
Abstract :
If knowledge such as classification rules are extracted from sample data in a distributed way, it may be necessary to combine or fuse these rules. In a conventional approach this would typically be done either by combining the classifiers´ outputs (e.g., in form of a classifier ensemble) or by combining the sets of classification rules (e.g., by weighting them individually). In this paper, we introduce a new way of fusing classifiers at the level of parameters of classification rules. This technique is based on the use of probabilistic generative classifiers using multinomial distributions for categorical input dimensions and multivariate normal distributions for the continuous ones. That means, we have distributions such as Dirichlet or normal-Wishart distributions over parameters of the classifier. We refer to these distributions as hyperdistributions or second-order distributions. We show that fusing two (or more) classifiers can be done by multiplying the hyperdistributions of the parameters and derive simple formulas for that task. Properties of this new approach are demonstrated with a few experiments. The main advantage of this fusion approach is that the hyperdistributions are retained throughout the fusion process. Thus, the fused components may, for example, be used in subsequent training steps (online training).
Keywords :
data mining; normal distribution; pattern classification; sensor fusion; Dirichlet distribution; categorical input dimensions; classification rules; classifier fusion; classifier outputs; data mining applications; hyperdistributions; knowledge fusion; multinomial distributions; multivariate normal distributions; normal-Wishart distribution; probabilistic generative classifier; second-order distributions; Bayesian methods; Coordinate measuring machines; Covariance matrix; Data mining; Knowledge engineering; Probabilistic logic; Training; Bayesian techniques; Knowledge fusion; classifier fusion; data mining; generative classifier; probabilistic classifier;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2013.20
Filename :
6420835
Link To Document :
بازگشت