Title :
A risk-based approach to optimal clustering under random labeled point processes
Author_Institution :
Department of Electrical and Computer Engineering and Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA
Abstract :
Typically, optimization in clustering is relative to a heuristic metric, rather than relative to a definition of error with respect to a probabilistic model to make clustering rigorously predictive. To address this, we develop a general risk- based formulation for clustering that parallels classical Bayes decision theory for classification, transforming clustering from a subjective activity to an objective operation. We develop a general analytic procedure to find an optimal clustering operator, called a Bayes clusterer, which corresponds to the Bayes classifier in classification theory. In particular, we address Gaussian models, and discuss fundamental limits of performance in clustering.
Keywords :
"Clustering algorithms","Partitioning algorithms","Gaussian mixture model","Optimization","Couplings","Probabilistic logic"
Conference_Titel :
Signals, Systems and Computers, 2015 49th Asilomar Conference on
Electronic_ISBN :
1058-6393
DOI :
10.1109/ACSSC.2015.7421160