Title :
An Online Clustering Algorithm That Ignores Outliers: Application to Hierarchical Feature Learning from Sensory Data
Author :
Banerjee, Biplab ; Dutta, Jayanta K.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Memphis, Memphis, TN, USA
Abstract :
Surveillance sensors are a major source of unstructured Big Data. Discovering and recognizing spatiotemporal objects (e.g., events) in such data is of paramount importance to the security and safety of facilities and individuals. Hierarchical feature learning is at the crux to the problems of discovery and recognition. We present a multilayered convergent neural architecture for storing repeating spatially and temporally coincident patterns in data at multiple levels of abstraction. The bottom-up weights in each layer are learned to encode a hierarchy of over complete and sparse feature dictionaries from space- and time-varying sensory data by recursive layer-by-layer spherical clustering. This density-based clustering algorithm ignores outliers by the use of a unique adaptive threshold in each neuron´s transfer function. The model scales to full-sized high-dimensional input data and also to an arbitrary number of layers, thereby possessing the capability to capture features at any level of abstraction. It is fully-learnable with only two manually tunable parameters. The model was deployed to learn meaningful feature hierarchies from audio, images and videos which can then be used for recognition and reconstruction. Besides being online, operations in each layer of the model can be implemented in parallelized hardware, making it very efficient for real world Big Data applications.
Keywords :
data handling; learning (artificial intelligence); neural nets; pattern clustering; hierarchical feature learning; manually tunable parameters; multilayered convergent neural architecture; online clustering algorithm; recursive layer-by-layer spherical clustering; sensory data; space-varying sensory data; sparse feature dictionaries; spatially coincident patterns; surveillance sensors; temporally coincident patterns; time-varying sensory data; unstructured big data; Clustering algorithms; Computer architecture; Data models; Feedforward neural networks; Neurons; Radio frequency; Videos; Hebbian rule; outlier; spherical clustering;
Conference_Titel :
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4799-3143-9
DOI :
10.1109/ICDMW.2013.135