DocumentCode
710025
Title
Incorporating adaptive discretization into genetic programming for data classification
Author
Dufourq, Emmanuel ; Pillay, Nelishia
Author_Institution
Sch. of Math., Univ. of KwaZulu-Natal, Durban, South Africa
fYear
2013
fDate
15-18 Dec. 2013
Firstpage
127
Lastpage
133
Abstract
Genetic programming (GP) for data classification using decision trees has been successful in creating models which obtain high classification accuracies. When categorical data is used GP is able to directly use decision trees to create models, however when the data contains continuous attributes discretization is required as a pre-processing step prior to learning. There has been no attempt to incorporate the discretization mechanism into the GP algorithm and this serves as the rationale for this paper. This paper proposes an adaptive discretization method for inclusion into the GP algorithm by randomly creating intervals during the execution of the algorithm through the use of a new genetic operator. This proposed approach was tested on five data sets and serves as an initial attempt at dynamically altering the intervals of GP decision trees while simultaneously searching for an optimal solution during the learning phase. The proposed method performs well when compared to other non-GP adaptive methods.
Keywords
decision trees; genetic algorithms; learning (artificial intelligence); mathematical operators; pattern classification; GP algorithm; GP decision trees; adaptive discretization; data classification; genetic operator; genetic programming; learning phase; Accuracy; Adaptation models; Decision trees; Genetics; Glass; Ionosphere; Iris; classification; data mining; discretization; genetic programming; optimization;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Technologies (WICT), 2013 Third World Congress on
Conference_Location
Hanoi
Type
conf
DOI
10.1109/WICT.2013.7113123
Filename
7113123
Link To Document