• DocumentCode
    3335321
  • Title

    Building a Novel GP-Based Software Quality Classifier Using Multiple Validation Datasets

  • Author

    Liu, Yi ; Khoshgoftaar, Taghi ; Yao, Jenq-Foung

  • Author_Institution
    Georgia Coll. & State Univ., Milledgeville
  • fYear
    2007
  • fDate
    13-15 Aug. 2007
  • Firstpage
    644
  • Lastpage
    650
  • Abstract
    One problem associated with software quality classification (SQC) modeling is that the historical metric dataset obtained from a single software project are often not adequate to build robust and accurate models. To address this issue, multiple datasets obtained from different software projects are used for SQC modeling in recent research works. Our previous study has demonstrated that using multiple datasets for validation can achieve robust genetic programming (GP)-based SQC models. This paper further investigates the effectiveness of using multiple validation datasets. Moreover, a novel GP-based classifier consisting of training, multiple-dataset validation, and voting phases, is proposed. The experiments are carried out on seven NASA software projects. The results are compared with the results achieved by seventeen other data mining techniques. The comparisons demonstrate that the performance of our approach is significantly better by using multiple datasets from different software projects with similar reliability goals.
  • Keywords
    genetic algorithms; pattern classification; software management; software metrics; software quality; NASA software project; data mining; multiple validation dataset; multiple-dataset validation; robust genetic programming; similar reliability goal; software quality classification modeling; software quality classifier; Data mining; Educational institutions; Genetic programming; NASA; Project management; Robustness; Software metrics; Software quality; System testing; Voting; genetic programming; model selection; multiple datasets; paired t-test; software metrics; software quality classification; validation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
  • Conference_Location
    Las Vegas, IL
  • Print_ISBN
    1-4244-1500-4
  • Electronic_ISBN
    1-4244-1500-4
  • Type

    conf

  • DOI
    10.1109/IRI.2007.4296693
  • Filename
    4296693