DocumentCode
3239631
Title
Information cut and information forces for clustering
Author
Jenssen, Robert ; Principe, Jose C. ; Eltoft, Torbjørn
Author_Institution
Comput. NeuroEngineering Lab., Florida Univ., Gainesville, FL, USA
fYear
2003
fDate
17-19 Sept. 2003
Firstpage
459
Lastpage
468
Abstract
We define an information-theoretic divergence measure between probability density functions (pdfs) that has a deep connection to the cut in graph-theory. This connection is revealed when the pdfs are estimated by the Parzen method with a Gaussian kernel. We refer to our divergence measure as the information cut. The information cut provides us with a theoretically sound criterion for cluster evaluation. In this paper we show that it can be used to merge clusters. The initial clusters are obtained based on the related concept of information forces. We create directed trees by selecting the predecessor of a node (pattern) according to the direction of the information force acting on the pattern. Each directed tree corresponds to a cluster, hence enabling us to obtain an initial partitioning of the data set. Subsequently, we utilize the information cut as a cluster evaluation function to merge clusters until the predefined number of clusters is reached. We demonstrate the performance of our novel information-theoretic clustering method when applied to both artificially created data and real data, with encouraging results.
Keywords
data analysis; graph theory; information theory; pattern clustering; Gaussian kernel; Parzen method; cluster evaluation function; divergence measure; graph theory; information cut; information forces; information-theoretic clustering method; probability density functions; Clustering algorithms; Clustering methods; Cost function; Data structures; Density measurement; Entropy; Kernel; Laboratories; Neural engineering; Probability density function;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks for Signal Processing, 2003. NNSP'03. 2003 IEEE 13th Workshop on
ISSN
1089-3555
Print_ISBN
0-7803-8177-7
Type
conf
DOI
10.1109/NNSP.2003.1318045
Filename
1318045
Link To Document