DocumentCode
46478
Title
Pitman Yor Diffusion Trees for Bayesian Hierarchical Clustering
Author
Knowles, David A. ; Ghahramani, Zoubin
Author_Institution
, Stanford University, Menlo Park, California
Volume
37
Issue
2
fYear
2015
fDate
Feb. 1 2015
Firstpage
271
Lastpage
289
Abstract
In this paper we introduce the Pitman Yor Diffusion Tree (PYDT), a Bayesian non-parametric prior over tree structures which generalises the Dirichlet Diffusion Tree [30] and removes the restriction to binary branching structure. The generative process is described and shown to result in an exchangeable distribution over data points. We prove some theoretical properties of the model including showing its construction as the continuum limit of a nested Chinese restaurant process model. We then present two alternative MCMC samplers which allow us to model uncertainty over tree structures, and a computationally efficient greedy Bayesian EM search algorithm. Both algorithms use message passing on the tree structure. The utility of the model and algorithms is demonstrated on synthetic and real world data, both continuous and binary.
Keywords
Bayes methods; Computational modeling; Data models; Equations; Hidden Markov models; TV; Vegetation; Machine learning; clustering methods; density estimation; phylogeny; robust algorithm; unsupervised learning;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2014.2313115
Filename
6777276
Link To Document