DocumentCode
39932
Title
Jeffreys Centroids: A Closed-Form Expression for Positive Histograms and a Guaranteed Tight Approximation for Frequency Histograms
Author
Nielsen, Frank
Author_Institution
Sony Comput. Sci. Labs., Inc., Tokyo, Japan
Volume
20
Issue
7
fYear
2013
fDate
Jul-13
Firstpage
657
Lastpage
660
Abstract
Due to the success of the bag-of-word modeling paradigm, clustering histograms has become an important ingredient of modern information processing. Clustering histograms can be performed using the celebrated k-means centroid-based algorithm. From the viewpoint of applications, it is usually required to deal with symmetric distances. In this letter, we consider the Jeffreys divergence that symmetrizes the Kullback-Leibler divergence, and investigate the computation of Jeffreys centroids. We first prove that the Jeffreys centroid can be expressed analytically using the Lambert W function for positive histograms. We then show how to obtain a fast guaranteed approximation when dealing with frequency histograms. Finally, we conclude with some remarks on the k-means histogram clustering.
Keywords
approximation theory; document handling; pattern classification; pattern clustering; Jeffreys centroids; Kullback-Leibler divergence; Lambert W function; bag-of-word modeling paradigm; closed-form expression; document classification; frequency histograms; guaranteed tight approximation; histograms clustering; k-means centroid-based algorithm; k-means histogram clustering; positive histograms; Approximation algorithms; Approximation methods; Clustering algorithms; Databases; Histograms; Signal processing algorithms; Visualization; Centroid; Jeffreys divergence; Kullback–Leibler divergence; Lambert $W$ function; clustering; histogram;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2013.2260538
Filename
6509930
Link To Document