Title :
Transductive Methods for Distributed Ensemble Classification
Author :
Miller, David J. ; Pal, Siddharth
Author_Institution :
Dept. of Electr. Eng., Pennsylvania State Univ., University Park, PA
Abstract :
We consider ensemble classification for the case when there is no common labeled training data for designing the function which aggregates individual classifier decisions. We dub this problem distributed ensemble classification, addressing e.g. when individual classifiers are trained on different (e.g. proprietary, legacy) databases or operate (perhaps remotely) on different sensing modalities. Typically, fixed, principled (untrained) rules of classifier combination such as voting methods are used in this case for aggregating decisions. Alternatively, we take a transductive approach, optimizing the combining rule for an objective function measured on the un labeled batch of test data. We propose specific maximum likelihood (ML) objectives that are shown to yield well-known forms of aggregation, albeit with iterative, EM-based adjustment to account for possible mismatch between the class priors used by individual classifiers and those reflected in the new data batch. We also propose an information-theoretic method which outperforms the ML methods and addresses some problem instances where the ML methods are not applicable. On benchmark data from the UC Irvine machine learning repository, all our methods give improvements in accuracy over the use of fixed rules when there is prior mismatch.
Keywords :
learning (artificial intelligence); maximum likelihood estimation; pattern classification; Irvine machine learning repository; distributed ensemble classification; information-theoretic method; maximum likelihood objective; objective function; transductive approach; Aggregates; Algorithm design and analysis; Design optimization; Distributed databases; Machine learning; Spatial databases; Supervised learning; Testing; Training data; Voting; EM algorithm; decision aggregation; ensemble classification; transductive learning;
Conference_Titel :
Information Sciences and Systems, 2006 40th Annual Conference on
Conference_Location :
Princeton, NJ
Print_ISBN :
1-4244-0349-9
Electronic_ISBN :
1-4244-0350-2
DOI :
10.1109/CISS.2006.286392