Title :
Regularized discriminative clustering
Author :
Kaski, Samuel ; Sinkkonen, Janne ; Klami, Arto
Author_Institution :
Neural Network Res. Centre, Helsinki Univ. of Technol., Finland
Abstract :
A generative distributional clustering model for continuous data is reviewed and methods for optimizing and regularizing it are introduced and compared. Based on pairs of auxiliary and primary data, the primary data space is partitioned into Voronoi regions that are maximally homogeneous in terms of auxiliary data. Then only variation in the primary data associated with variation in the auxiliary data influences the clusters. Because the whole primary space is partitioned, new samples can be easily clustered in terms of primary data alone. In experiments, the approach is shown to produce more homogeneous clusters than alternative methods. Two regularization methods are demonstrated to further improve the results: an entropy-type penalty for unequal cluster sizes, and the inclusion of a K-means component to the model. The latter can alternatively be interpreted as special kind of joint distribution modeling where the emphasis between discrimination and unsupervised modeling of primary data can be tuned.
Keywords :
conjugate gradient methods; statistical analysis; statistical distributions; K-means component; Voronoi regions; auxiliary data; distribution modeling; entropy-type penalty; generative distributional clustering model; primary data space; regularized discriminative clustering; Bayesian methods; Clustering algorithms; Data mining; Neural networks; Optimization methods; Partitioning algorithms; Predictive models; Prototypes; Smoothing methods; Space technology;
Conference_Titel :
Neural Networks for Signal Processing, 2003. NNSP'03. 2003 IEEE 13th Workshop on
Print_ISBN :
0-7803-8177-7
DOI :
10.1109/NNSP.2003.1318028