• DocumentCode
    344320
  • Title

    Breeding policies in evolutionary approximation of optimal subspace

  • Author

    Huang, H.M. ; Leung, P.L.

  • Author_Institution
    City Univ. of Hong Kong, Kowloon, Hong Kong
  • Volume
    1
  • fYear
    1999
  • fDate
    36342
  • Firstpage
    285
  • Abstract
    In very high dimension variable space (e.g. 30 or more), huge computations evenly hinder investigators to conduct any direct meaningful analysis. A traditional trick is firstly to conduct single variable analysis, then combine several top most single-fittest variables to approximate the optimal subspace. In this investigation, an evolutionary method for optimal subspace approximation is proposed. The breeding policies of this evolutionary approximation, its scalability and generalization have been intensively investigated. The studied object is a 30-D variable space which contains 6000 artificial individuals. In this data, except for 3 variables containing two donut-type data distributions, each with 3000 individuals, the remaining 27 variables only contain quasi-random data with the same value range as the donut data distributions. The donut distribution consist of two toroidal distributions (classes) which are interlocked like links in a chain. The cross-section of each distribution is a Gaussian function distributed with standard deviation delta. Even the Donut problem which possesses a variety of pathological traits can invalidate many non-complex analyses for classification. The goal of this investigation was to find the 3 donut variables within the optimal subspace of 30-D variable space in which most quasi-random variables emerge as noise variables. In order to reach this goal, various breeding policies were implemented and compared. Although no perfect solution for the approximation was found, various breeding policies and their impact on decreasing the error were studied. These were found to be relatively usable for reference and might be improved when used in a practical application
  • Keywords
    Gaussian distribution; evolutionary computation; Gaussian function; breeding policies; donut-type data distributions; error; evolutionary approximation; noise variables; optimal subspace; quasi-random data; single variable analysis; toroidal distributions; Data mining; Data visualization; Feature extraction; Genetic algorithms; Pathology; Pattern analysis; Pattern recognition; Performance analysis; Scalability; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Processing and Manufacturing of Materials, 1999. IPMM '99. Proceedings of the Second International Conference on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    0-7803-5489-3
  • Type

    conf

  • DOI
    10.1109/IPMM.1999.792491
  • Filename
    792491