DocumentCode :
2753464
Title :
The Assessment and Application of Lineage Information in Genetic Programs for Producing Better Models
Author :
Boetticher, Gary D. ; Kaminsky, Kim
Author_Institution :
Houston Univ., TX
fYear :
2006
fDate :
16-18 Sept. 2006
Firstpage :
141
Lastpage :
146
Abstract :
One of the challenges in data mining, and in particular genetic programs, is to provide sufficient coverage of the search space in order to produce an acceptable model. Traditionally, genetic programs generate equations (chromosomes) and consider all chromosomes within a population for breeding purposes. Considering the enormity of the search space for complex problems, it is imperative to examine genetic programs breeding efforts in order to produce better solutions with less training. This research examines chromosome lineage within genetic programs in order to identify breeding patterns. Fitness values for chromosomes are sorted, then partitioned into five classes. Initial experiments reveal a distinct difference between upper, middle, and lower classes. Based upon initial results, a novel genetic programming process is proposed which breeds a new generation exclusively from the top 20 percent of a population. A second set of experiments statistically validate this proposed approach
Keywords :
data mining; genetic algorithms; search problems; data mining; genetic programming; lineage information; search space; Application software; Biological cells; Chromosome mapping; Data mining; Equations; Genetic programming; Lakes; Solids; Space exploration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration, 2006 IEEE International Conference on
Conference_Location :
Waikoloa Village, HI
Print_ISBN :
0-7803-9788-6
Type :
conf
DOI :
10.1109/IRI.2006.252403
Filename :
4018480
Link To Document :
بازگشت