DocumentCode :
3239631
Title :
Information cut and information forces for clustering
Author :
Jenssen, Robert ; Principe, Jose C. ; Eltoft, Torbjørn
Author_Institution :
Comput. NeuroEngineering Lab., Florida Univ., Gainesville, FL, USA
fYear :
2003
fDate :
17-19 Sept. 2003
Firstpage :
459
Lastpage :
468
Abstract :
We define an information-theoretic divergence measure between probability density functions (pdfs) that has a deep connection to the cut in graph-theory. This connection is revealed when the pdfs are estimated by the Parzen method with a Gaussian kernel. We refer to our divergence measure as the information cut. The information cut provides us with a theoretically sound criterion for cluster evaluation. In this paper we show that it can be used to merge clusters. The initial clusters are obtained based on the related concept of information forces. We create directed trees by selecting the predecessor of a node (pattern) according to the direction of the information force acting on the pattern. Each directed tree corresponds to a cluster, hence enabling us to obtain an initial partitioning of the data set. Subsequently, we utilize the information cut as a cluster evaluation function to merge clusters until the predefined number of clusters is reached. We demonstrate the performance of our novel information-theoretic clustering method when applied to both artificially created data and real data, with encouraging results.
Keywords :
data analysis; graph theory; information theory; pattern clustering; Gaussian kernel; Parzen method; cluster evaluation function; divergence measure; graph theory; information cut; information forces; information-theoretic clustering method; probability density functions; Clustering algorithms; Clustering methods; Cost function; Data structures; Density measurement; Entropy; Kernel; Laboratories; Neural engineering; Probability density function;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks for Signal Processing, 2003. NNSP'03. 2003 IEEE 13th Workshop on
ISSN :
1089-3555
Print_ISBN :
0-7803-8177-7
Type :
conf
DOI :
10.1109/NNSP.2003.1318045
Filename :
1318045
Link To Document :
بازگشت