DocumentCode :
915785
Title :
Learning shape-classes using a mixture of tree-unions
Author :
Torsello, Andrea ; Hancock, Edwin R.
Author_Institution :
Dipartimento di Inf., Univ. Ca´´Foscari di Venezia, Torino, Italy
Volume :
28
Issue :
6
fYear :
2006
fDate :
6/1/2006 12:00:00 AM
Firstpage :
954
Lastpage :
967
Abstract :
This paper poses the problem of tree-clustering as that of fitting a mixture of tree unions to a set of sample trees. The tree-unions are structures from which the individual data samples belonging to a cluster can be obtained by edit operations. The distribution of observed tree nodes in each cluster sample is assumed to be governed by a Bernoulli distribution. The clustering method is designed to operate when the correspondences between nodes are unknown and must be inferred as part of the learning process. We adopt a minimum description length approach to the problem of fitting the mixture model to data. We make maximum-likelihood estimates of the Bernoulli parameters. The tree-unions and the mixing proportions are sought so as to minimize the description length criterion. This is the sum of the negative logarithm of the Bernoulli distribution, and a message-length criterion that encodes both the complexity of the union-trees and the number of mixture components. We locate node correspondences by minimizing the edit distance with the current tree unions, and show that the edit distance is linked to the description length criterion. The method can be applied to both unweighted and weighted trees. We illustrate the utility of the resulting algorithm on the problem of classifying 20 shapes using a shock graph representation.
Keywords :
learning (artificial intelligence); maximum likelihood estimation; pattern clustering; statistical distributions; trees (mathematics); Bernoulli distribution; cluster sample; data samples; description length criterion; edit distance; edit operations; maximum-likelihood estimation; message-length criterion; minimum description length; mixing proportions; negative logarithm; observed tree nodes distribution; sample trees; shape-class learning; shock graph representation; tree-clustering problem; tree-union mixture; weighted trees; Clustering algorithms; Clustering methods; Computer vision; Design methodology; Electric shock; Maximum likelihood estimation; Shape; Skeleton; Statistics; Tree graphs; Structural learning; minimum description length; mixture modelinq; model codes; shock graphs.; tree clustering; Algorithms; Artificial Intelligence; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2006.125
Filename :
1624359
Link To Document :
بازگشت