DocumentCode :
2851188
Title :
Data Reduction by Genetic Algorithms and Non-Algebraic Feature Construction: A Case Study
Author :
Shafti, Leila S. ; Perez, Ernesto
Author_Institution :
Univ. Autonoma de Madrid, Madrid
fYear :
2008
fDate :
10-12 Sept. 2008
Firstpage :
573
Lastpage :
578
Abstract :
Real-world data are often prepared for purposes other than data mining and machine learning and, therefore, are represented by primitive attributes. When data representation is primitive, preprocessing data before looking for patterns becomes necessary. If lack of domain experts prevents the use of highly informative attributes, patterns are hard to uncover due to complex attribute interactions. This article suggests a new use of MFE3/GA to restructure the primitive data representation by means of capturing and compacting hidden information into new features in order to highlight them to the learner. Empirical results on Poker Hand data set show that the new use successfully improves learning this concept by means of data reduction, generation of a smaller decision tree classifier, and accuracy improvement.
Keywords :
data reduction; data structures; genetic algorithms; complex attribute interactions; data mining; data reduction; data representation; decision tree classifier; genetic algorithms; machine learning; nonalgebraic feature construction; Classification tree analysis; Data mining; Data preprocessing; Decision trees; Error analysis; Genetic algorithms; Humans; Hybrid intelligent systems; Machine learning; Machine learning algorithms; Attribute Interaction; Data Reduction; Feature Construction; Genetic Algorithm; Machine Learning; Non-algebraic Representation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Hybrid Intelligent Systems, 2008. HIS '08. Eighth International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-0-7695-3326-1
Electronic_ISBN :
978-0-7695-3326-1
Type :
conf
DOI :
10.1109/HIS.2008.114
Filename :
4626691
Link To Document :
بازگشت