DocumentCode :
2753767
Title :
Developing an Effective Validation Strategy for Genetic Programming Models Based on Multiple Datasets
Author :
Liu, Yi ; Khoshgoftaar, Taghi ; Yao, Jenq-Foung
Author_Institution :
Georgia Coll. & State Univ., Milledgeville, GA
fYear :
2006
fDate :
16-18 Sept. 2006
Firstpage :
232
Lastpage :
237
Abstract :
Genetic programming (GP) is a parallel searching technique where many solutions can be obtained simultaneously in the searching process. However, when applied to real-world classification tasks, some of the obtained solutions may have poor predictive performances. One of the reasons is that these solutions only match the shape of the training dataset, failing to learn and generalize the patterns hidden in the dataset. Therefore, unexpected poor results are obtained when the solutions are applied to the test dataset. This paper addresses how to remove the solutions which will have unacceptable performances on the test dataset. The proposed method in this paper applies a multi-dataset validation phase as a filter in GP-based classification tasks. By comparing our proposed method with a standard GP classifier based on the datasets from seven different NASA software projects, we demonstrate that the multi-dataset validation is effective, and can significantly improve the performance of GP-based software quality classification models
Keywords :
genetic algorithms; pattern classification; program verification; software quality; NASA software project; genetic programming; model selection; multidataset validation; paired t-tests; software metrics; software quality classification; Filters; Genetic programming; NASA; Pattern matching; Performance evaluation; Shape; Software performance; Software quality; Software standards; Testing; cost misclassification; genetic programming; model selection; multiple datasets; paired t-test; software metrics; software quality classification; validation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration, 2006 IEEE International Conference on
Conference_Location :
Waikoloa Village, HI
Print_ISBN :
0-7803-9788-6
Type :
conf
DOI :
10.1109/IRI.2006.252418
Filename :
4018495
Link To Document :
بازگشت