Title :
The optimal hyperparameter for Bayesian clustering and its application to the evaluation of clustering results
Author :
Yamazaki, Keisuke
Author_Institution :
Dept. of Comput. Intell. & Syst. Sci., Tokyo Inst. of Technol., Yokohama, Japan
Abstract :
In a probabilistic approach to cluster analysis, parametric models, such as a mixture of Gaussian distributions, are often used. Since the parameter is unknown, it is necessary to estimate both the parameter and the labels of the clusters. Recently, the statistical properties of Bayesian clustering have been studied. The theoretical accuracy has been analyzed, and it has been found to be better than the maximum-likelihood method, which is based on the expectation-maximization algorithm. However, the effect of a prior distribution on the clustering result remains unknown. In the present paper, we theoretically and experimentally investigate the behavior of the optimal hyperparameter, which is the parameter of the prior distribution, and we propose an evaluation method for the clustering result, based on the prior optimization.
Keywords :
Bayes methods; Gaussian distribution; expectation-maximisation algorithm; optimisation; pattern clustering; Bayesian clustering; Gaussian distribution; cluster analysis; expectation-maximization algorithm; maximum-likelihood method; optimal hyperparameter; parameter estimation; parametric model; prior distribution; prior optimization; probabilistic approach; statistical property; Accuracy; Bayes methods; Gaussian distribution; Maximum likelihood estimation; Optimization; Probabilistic logic; Shape;
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS), 2014 Joint 7th International Conference on and Advanced Intelligent Systems (ISIS), 15th International Symposium on
DOI :
10.1109/SCIS-ISIS.2014.7044654