Title :
Sided and Symmetrized Bregman Centroids
Author :
Nielsen, Frank ; Nock, Richard
Author_Institution :
Dept. of Fundamental Res., Sony Comput. Sci. Labs., Inc., Tokyo
fDate :
6/1/2009 12:00:00 AM
Abstract :
In this paper, we generalize the notions of centroids (and barycenters) to the broad class of information-theoretic distortion measures called Bregman divergences. Bregman divergences form a rich and versatile family of distances that unifies quadratic Euclidean distances with various well-known statistical entropic measures. Since besides the squared Euclidean distance, Bregman divergences are asymmetric, we consider the left-sided and right-sided centroids and the symmetrized centroids as minimizers of average Bregman distortions. We prove that all three centroids are unique and give closed-form solutions for the sided centroids that are generalized means. Furthermore, we design a provably fast and efficient arbitrary close approximation algorithm for the symmetrized centroid based on its exact geometric characterization. The geometric approximation algorithm requires only to walk on a geodesic linking the two left/right-sided centroids. We report on our implementation for computing entropic centers of image histogram clusters and entropic centers of multivariate normal distributions that are useful operations for processing multimedia information and retrieval. These experiments illustrate that our generic methods compare favorably with former limited ad hoc methods.
Keywords :
content-based retrieval; feature extraction; image retrieval; multimedia communication; Bregman divergences; ad hoc methods; approximation algorithm; content-based multimedia retrieval applications; feature extractions; image retrieval; information-theoretic distortion; left sided centroids; multimedia information processing; quadratic Euclidean distances; right-sided centroids; statistical entropic measures; symmetrized Bregman centroids; Algorithm design and analysis; Approximation algorithms; Closed-form solution; Clustering algorithms; Distortion measurement; Distributed computing; Euclidean distance; Geophysics computing; Histograms; Joining processes; Bregman divergence; Bregman information; Bregman power divergence; Burbea–Rao divergence; CsiszÁr $f$ -divergences; Kullback–Leibler divergence; Legendre duality; centroid; information geometry; information radius;
Journal_Title :
Information Theory, IEEE Transactions on
DOI :
10.1109/TIT.2009.2018176