Title :
An Improved ART 2-A Model for Mixed Numeric and Categorical Data
Author :
Han, Xiao ; Yang, Yahui ; Shen, Qingni ; Xia, Min
Author_Institution :
Sch. of Software & Microelectron., Peking Univ., Beijing, China
Abstract :
Adaptive resonance theory (ART) architectures are important neural networks for unsupervised clustering. ART 2-A is one version of the ART family capable of clustering both binary and numeric data. However, real-world problems usually contain categorical data that cannot be processed by ART 2-A. A simple solution is using binary encoding to preprocess categorical data. Binary encoding is a simple and straightforward approach, but it suffers from two main drawbacks: increase of dimensionality and lack of scalability. Therefore this paper proposes ART 2a-M, an improved version over ART 2-A. ART 2a-M can deal with mixed numeric and categorical data. Experiments were carried out on KDD Cup 99 data set to compare ART 2a-M with ART 2-A. Results show that not only ART 2a-M overcomes the two drawbacks of binary encoding, but also runs about 10% faster than the original one, while keeping the same accuracy.
Keywords :
adaptive resonance theory; encoding; neural nets; pattern clustering; unsupervised learning; ART 2-A model; adaptive resonance theory; binary encoding; categorical data; neural network; numeric data; unsupervised clustering; Computer architecture; Encoding; Microelectronics; Neural networks; Numerical models; Organizing; Resonance; Scalability; Stability; Subspace constraints;
Conference_Titel :
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4994-1
DOI :
10.1109/ICIECS.2009.5365746