Title :
Automated feature extraction for supervised learning
Author :
Laird, Philip ; Saul, Ronald
Author_Institution :
NASA Ames Res. Center, Moffett Field, CA, USA
Abstract :
Feature extraction has traditionally been a manual process and something of an art. Methods derived from statistics and linear systems theory have been proposed, but by general consensus effective feature extraction remains a difficult problem. Recently W. Tackett (1993) showed that genetic programming (GP) can be effective in automatically constructing features for identifying potential targets in digital images with high accuracy. From a basis set of simple arithmetic functions, he was able to construct numerical features that outperformed manually-constructed features when used as inputs to several classifiers, including a binary-tree classifier and a multi-layer perceptron trained by back-propagation. Seeking a more generic feature-construction procedure, we developed a GP-based algorithm to extract features in a variety of domains and for most classification methods, including decision trees, feed-forward neural networks, and Bayesian classifiers. We have tested the technique with success by extracting features for three different types of problems: Boolean functions with binary features, a NASA telemetry problem with multiple classes and real-valued time-series inputs, and a wine variety classification problem with real-valued features from the UCI Machine Learning repository. We formally define the feature-construction method and show in some detail how it applies to specific classification problems
Keywords :
Boolean functions; feature extraction; feedforward neural nets; genetic algorithms; learning (artificial intelligence); pattern recognition; search problems; Bayesian classifiers; Boolean functions; GP-based algorithm; NASA telemetry problem; UCI Machine Learning repository; automated feature extraction; binary features; binary-tree classifier; classification methods; decision trees; digital images; feed-forward neural networks; generic feature-construction procedure; genetic programming; multi-layer perceptron; numerical features; real-valued features; real-valued time-series inputs; simple arithmetic functions; supervised learning; wine variety classification problem; Arithmetic; Art; Classification tree analysis; Digital images; Feature extraction; Genetic programming; Linear systems; Multilayer perceptrons; Statistics; Supervised learning;
Conference_Titel :
Evolutionary Computation, 1994. IEEE World Congress on Computational Intelligence., Proceedings of the First IEEE Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-1899-4
DOI :
10.1109/ICEC.1994.349977