Title :
Data Reduction by Genetic Algorithms and Non-Algebraic Feature Construction: A Case Study
Author :
Shafti, Leila S. ; Perez, Ernesto
Author_Institution :
Univ. Autonoma de Madrid, Madrid
Abstract :
Real-world data are often prepared for purposes other than data mining and machine learning and, therefore, are represented by primitive attributes. When data representation is primitive, preprocessing data before looking for patterns becomes necessary. If lack of domain experts prevents the use of highly informative attributes, patterns are hard to uncover due to complex attribute interactions. This article suggests a new use of MFE3/GA to restructure the primitive data representation by means of capturing and compacting hidden information into new features in order to highlight them to the learner. Empirical results on Poker Hand data set show that the new use successfully improves learning this concept by means of data reduction, generation of a smaller decision tree classifier, and accuracy improvement.
Keywords :
data reduction; data structures; genetic algorithms; complex attribute interactions; data mining; data reduction; data representation; decision tree classifier; genetic algorithms; machine learning; nonalgebraic feature construction; Classification tree analysis; Data mining; Data preprocessing; Decision trees; Error analysis; Genetic algorithms; Humans; Hybrid intelligent systems; Machine learning; Machine learning algorithms; Attribute Interaction; Data Reduction; Feature Construction; Genetic Algorithm; Machine Learning; Non-algebraic Representation;
Conference_Titel :
Hybrid Intelligent Systems, 2008. HIS '08. Eighth International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-0-7695-3326-1
Electronic_ISBN :
978-0-7695-3326-1
DOI :
10.1109/HIS.2008.114