DocumentCode
78349
Title
A Bayesian Predictive Model for Clustering Data of Mixed Discrete and Continuous Type
Author
Blomstedt, P. ; Jing Tang ; Jie Xiong ; Granlund, C. ; Corander, J.
Author_Institution
Dept. of Inf. & Comput. Sci., Aalto Univ., Helsinki, Finland
Volume
37
Issue
3
fYear
2015
fDate
March 1 2015
Firstpage
489
Lastpage
498
Abstract
Advantages of model-based clustering methods over heuristic alternatives have been widely demonstrated in the literature. Most model-based clustering algorithms assume that the data are either discrete or continuous, possibly allowing both types to be present in separate features. In this paper, we introduce a model-based approach for clustering feature vectors of mixed type, allowing each feature to simultaneously take on both categorical and real values. Such data may be encountered, for instance, in chemical and biological analyses, in the analysis of survey data, as well as in image analysis. Our model is formulated within a Bayesian predictive framework, where clustering solutions correspond to random partitions of the data. Using conjugate analysis, the posterior probability for each possible partition can be determined analytically, enabling the utilization of efficient computational search strategies for finding the posterior optimal partition. The derived model is illustrated using several synthetic and real datasets.
Keywords
Bayes methods; feature selection; pattern clustering; vectors; Bayesian predictive model; conjugate analysis; continuous data; discrete data; feature selection; feature vector; model-based data clustering algorithms; posterior probability; Bayes methods; Clustering methods; Computational modeling; Data models; Educational institutions; Mathematical model; Predictive models; Bayes methods; mixed distributions; predictive models; unsupervised learning;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2014.2359431
Filename
6905809
Link To Document