• DocumentCode
    857260
  • Title

    Generalizing self-organizing map for categorical data

  • Author

    Hsu, Chung-Chian

  • Author_Institution
    Dept. of Inf. Manage., Nat. Yunlin Univ. of Sci. & Technol., Taiwan
  • Volume
    17
  • Issue
    2
  • fYear
    2006
  • fDate
    3/1/2006 12:00:00 AM
  • Firstpage
    294
  • Lastpage
    304
  • Abstract
    The self-organizing map (SOM) is an unsupervised neural network which projects high-dimensional data onto a low-dimensional grid and visually reveals the topological order of the original data. Self-organizing maps have been successfully applied to many fields, including engineering and business domains. However, the conventional SOM training algorithm handles only numeric data. Categorical data are usually converted to a set of binary data before training of an SOM takes place. If a simple transformation scheme is adopted, the similarity information embedded between categorical values may be lost. Consequently, the trained SOM is unable to reflect the correct topological order. This paper proposes a generalized self-organizing map model that offers an intuitive method of specifying the similarity between categorical values via distance hierarchies and, hence, enables the direct process of categorical values during training. In fact, distance hierarchy unifies the distance computation of both numeric and categorical values. The unification is done by mapping the values to distance hierarchies and then measuring the distance in the hierarchies. Experiments on synthetic and real datasets were conducted, and the results demonstrated the effectiveness of the generalized SOM model.
  • Keywords
    learning (artificial intelligence); self-organising feature maps; SOM training; binary data; categorical data; distance hierarchies; high-dimensional data projection; low-dimensional grid; self-organizing map; topological order; unsupervised neural network; Data analysis; Data visualization; Financial management; Helium; Information management; Management training; Marketing and sales; Neural networks; Self organizing feature maps; Transaction databases; Categorical data; cluster analysis; distance hierarchy; neural networks; self-organizing map (SOM); Algorithms; Artificial Intelligence; Computer Simulation; Models, Theoretical; Neural Networks (Computer); Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/TNN.2005.863415
  • Filename
    1603617