• DocumentCode
    395527
  • Title

    A SOM-based method for feature selection

  • Author

    Ye, Huilin ; Liu, Hanchang

  • Author_Institution
    Sch. of Electr. Eng. & Comput. Sci., Univ. of Newcastle, Callaghan, NSW, Australia
  • Volume
    3
  • fYear
    2002
  • fDate
    18-22 Nov. 2002
  • Firstpage
    1295
  • Abstract
    This paper presents a method, called feature competitive algorithm (FCA), for feature selection, which is based on an unsupervised neural network, the self-organising map (SOM). The FCA is capable of selecting the most important features describing target concepts from a given whole set of features via the unsupervised learning. The FCA is simple to implement and fast in feature selection as the learning can be done automatically and no need for training data. A quantitative measure, called average distance distortion ratio, is figured out to assess the quality of the selected feature set. An asymptotic optimal feature set can then be determined on the basis of the assessment. This addresses an open research issue in feature selection. This method has been applied to a real case, a software document collection consisting of a set of UNIX command manual pages. The results obtained from a retrieval experiment based on this collection demonstrated some very promising potential.
  • Keywords
    feature extraction; self-organising feature maps; unsupervised learning; UNIX; average distance distortion ratio; feature competitive algorithm; feature selection; self-organising map; unsupervised learning; unsupervised neural network; Australia; Computational complexity; Computer network management; Computer science; Distortion measurement; Educational institutions; Engineering management; Neural networks; Training data; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
  • Print_ISBN
    981-04-7524-1
  • Type

    conf

  • DOI
    10.1109/ICONIP.2002.1202830
  • Filename
    1202830