Title :
Information cut and information forces for clustering
Author :
Jenssen, Robert ; Principe, Jose C. ; Eltoft, Torbjørn
Author_Institution :
Comput. NeuroEngineering Lab., Florida Univ., Gainesville, FL, USA
Abstract :
We define an information-theoretic divergence measure between probability density functions (pdfs) that has a deep connection to the cut in graph-theory. This connection is revealed when the pdfs are estimated by the Parzen method with a Gaussian kernel. We refer to our divergence measure as the information cut. The information cut provides us with a theoretically sound criterion for cluster evaluation. In this paper we show that it can be used to merge clusters. The initial clusters are obtained based on the related concept of information forces. We create directed trees by selecting the predecessor of a node (pattern) according to the direction of the information force acting on the pattern. Each directed tree corresponds to a cluster, hence enabling us to obtain an initial partitioning of the data set. Subsequently, we utilize the information cut as a cluster evaluation function to merge clusters until the predefined number of clusters is reached. We demonstrate the performance of our novel information-theoretic clustering method when applied to both artificially created data and real data, with encouraging results.
Keywords :
data analysis; graph theory; information theory; pattern clustering; Gaussian kernel; Parzen method; cluster evaluation function; divergence measure; graph theory; information cut; information forces; information-theoretic clustering method; probability density functions; Clustering algorithms; Clustering methods; Cost function; Data structures; Density measurement; Entropy; Kernel; Laboratories; Neural engineering; Probability density function;
Conference_Titel :
Neural Networks for Signal Processing, 2003. NNSP'03. 2003 IEEE 13th Workshop on
Print_ISBN :
0-7803-8177-7
DOI :
10.1109/NNSP.2003.1318045