Title :
Using higher dimensionalities to identify abnormal behavior in noisy data sets
Author :
Olsen, David Allen
Author_Institution :
Univ. of Minnesota, Minneapolis, MN, USA
Abstract :
In cluster analysis, distinguishing abnormal behavior from noise is an important problem that has remained unresolved for many years. This paper presents a general approach for distinguishing abnormal behavior from noise and for identifying abnormal behavior in noisy data sets. The approach is an application of a new, complete linkage hierarchical clustering method for n·(n-1) over 2 + 1-level hierarchical sequences and a means for finding meaningful levels of such hierarchical sequences prior to performing a cluster analysis. These technologies were designed with small-n, large-m data sets in mind. Based on four broadly applicable assumptions, the approach uses higher dimensionalities to reveal inherent structure in noisy data sets and find meaningful levels in the corresponding hierarchical sequences. The new clustering method is used to construct only the cluster sets that correspond to these levels. Results from a first experiment show how the effects of noise are attenuated as the dimensionality of the data points increases. Results from a second experiment show how meaningful cluster sets can have real world meanings that are useful for identifying abnormal behavior.
Keywords :
pattern clustering; cluster analysis; complete linkage hierarchical clustering; noisy data sets; Clustering methods; Couplings; Noise; Noise measurement; Random variables; Sensors; Standards;
Conference_Titel :
Decision and Control (CDC), 2014 IEEE 53rd Annual Conference on
Conference_Location :
Los Angeles, CA
Print_ISBN :
978-1-4799-7746-8
DOI :
10.1109/CDC.2014.7040161