Title :
Statistical estimation for Single Linkage Hierarchical Clustering
Author :
Dekang Zhu;Dan Guralnik;Xuezhi Wang;Xiang Li;Bill Moran
Author_Institution :
School of Electronic Science and Engineering, National University of Defense Technology, Changsha, China
fDate :
6/1/2015 12:00:00 AM
Abstract :
Clustering is a key component of most detectors of cyber-attacks, and increasingly, for both theoretical and practical reasons, methods that produce hierarchical clusterings (dendrograms) are being deployed in this context. In particular Single Linkage Hierarchical Clustering (SLHC) is attracting considerable interest. Existing clustering algorithms take no account of uncertainties in the data. In this paper, we derive a statistical model for the estimation of dendrograms, taking into account the uncertainty (through noise or corruption) in the distances among data points. We focus on just the estimation of the hierarchy of partitions afforded by the dendrogram, rather than the heights in the latter. The concept of estimating this "dendrogram structure" under SLHC is introduced, and an approximate maximum likelihood estimator (MLE) for the dendrogram structure is described. The proposed concept is demonstrated by a simple Monte Carlo simulation which demonstrates that the proposed MLE method performs better than SLHC in obtaining the correct structure.
Keywords :
"Measurement","Maximum likelihood estimation","Couplings","Clustering algorithms","Monte Carlo methods","Clustering methods"
Conference_Titel :
Cyber Technology in Automation, Control, and Intelligent Systems (CYBER), 2015 IEEE International Conference on
Print_ISBN :
978-1-4799-8728-3
DOI :
10.1109/CYBER.2015.7288035