DocumentCode :
3666715
Title :
Statistical estimation for Single Linkage Hierarchical Clustering
Author :
Dekang Zhu;Dan Guralnik;Xuezhi Wang;Xiang Li;Bill Moran
Author_Institution :
School of Electronic Science and Engineering, National University of Defense Technology, Changsha, China
fYear :
2015
fDate :
6/1/2015 12:00:00 AM
Firstpage :
745
Lastpage :
750
Abstract :
Clustering is a key component of most detectors of cyber-attacks, and increasingly, for both theoretical and practical reasons, methods that produce hierarchical clusterings (dendrograms) are being deployed in this context. In particular Single Linkage Hierarchical Clustering (SLHC) is attracting considerable interest. Existing clustering algorithms take no account of uncertainties in the data. In this paper, we derive a statistical model for the estimation of dendrograms, taking into account the uncertainty (through noise or corruption) in the distances among data points. We focus on just the estimation of the hierarchy of partitions afforded by the dendrogram, rather than the heights in the latter. The concept of estimating this "dendrogram structure" under SLHC is introduced, and an approximate maximum likelihood estimator (MLE) for the dendrogram structure is described. The proposed concept is demonstrated by a simple Monte Carlo simulation which demonstrates that the proposed MLE method performs better than SLHC in obtaining the correct structure.
Keywords :
"Measurement","Maximum likelihood estimation","Couplings","Clustering algorithms","Monte Carlo methods","Clustering methods"
Publisher :
ieee
Conference_Titel :
Cyber Technology in Automation, Control, and Intelligent Systems (CYBER), 2015 IEEE International Conference on
Print_ISBN :
978-1-4799-8728-3
Type :
conf
DOI :
10.1109/CYBER.2015.7288035
Filename :
7288035
Link To Document :
بازگشت