Title :
Pairwise data clustering by deterministic annealing
Author :
Hofmann, Thomas ; Buhmann, Joachim M.
Author_Institution :
Inst. fur Inf. II, Rheinrich-Wilhelms Univ., Bonn, Germany
fDate :
1/1/1997 12:00:00 AM
Abstract :
Partitioning a data set and extracting hidden structure from the data arises in different application areas of pattern recognition, speech and image processing. Pairwise data clustering is a combinatorial optimization method for data grouping which extracts hidden structure from proximity data. We describe a deterministic annealing approach to pairwise clustering which shares the robustness properties of maximum entropy inference. The resulting Gibbs probability distributions are estimated by mean-field approximation. A new structure-preserving algorithm to cluster dissimilarity data and to simultaneously embed these data in a Euclidian vector space is discussed which can be used for dimensionality reduction and data visualization. The suggested embedding algorithm which outperforms conventional approaches has been implemented to analyze dissimilarity data from protein analysis and from linguistics. The algorithm for pairwise data clustering is used to segment textured images
Keywords :
image processing; inference mechanisms; maximum entropy methods; pattern recognition; probability; simulated annealing; speech processing; Euclidian vector space; Gibbs probability distributions; combinatorial optimization method; data grouping; data set partitioning; data visualization; deterministic annealing; dimensionality reduction; dissimilarity data clustering; hidden structure extraction; image processing; linguistics; maximum entropy inference; mean-field approximation; pairwise data clustering; pattern recognition; protein analysis; robustness; speech processing; structure-preserving algorithm; textured image segmentation; Algorithm design and analysis; Annealing; Clustering algorithms; Data analysis; Data mining; Image processing; Inference algorithms; Optimization methods; Pattern recognition; Speech processing;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on