Title :
Likelihood based hierarchical clustering
Author :
Castro, Rui M. ; Coates, Mark J. ; Nowak, Robert D.
Author_Institution :
Dept. of Electr. & Comput. Eng., Rice Univ., Houston, TX, USA
Abstract :
This paper develops a new method for hierarchical clustering. Unlike other existing clustering schemes, our method is based on a generative, tree-structured model that represents relationships between the objects to be clustered, rather than directly modeling properties of objects themselves. In certain problems, this generative model naturally captures the physical mechanisms responsible for relationships among objects, for example, in certain evolutionary tree problems in genetics and communication network topology identification. The paper examines the networking problem in some detail to illustrate the new clustering method. More broadly, the generative model may not reflect actual physical mechanisms, but it nonetheless provides a means for dealing with errors in the similarity matrix, simultaneously promoting two desirable features in clustering: intraclass similarity and interclass dissimilarity.
Keywords :
Markov processes; Monte Carlo methods; maximum likelihood estimation; network topology; pattern clustering; telecommunication networks; trees (mathematics); Markov chain Monte Carlo methods; bottom-up agglomerative approach; communication network topology identification; generative model; interclass dissimilarity; intraclass similarity; likelihood based hierarchical clustering; maximum penalized likelihood estimation; model-based clustering; tree-structured model; Algorithm design and analysis; Clustering algorithms; Clustering methods; Communication networks; Covariance matrix; Genetics; Measurement errors; Network topology; Process design; Proteins; Markov Chain Monte Carlo methods; model-based clustering; network topology identification; tree models;
Journal_Title :
Signal Processing, IEEE Transactions on
DOI :
10.1109/TSP.2004.831124