• DocumentCode
    1575323
  • Title

    Impact Analysis of Missing Values on the Prediction Accuracy of Analogy-based Software Effort Estimation Method AQUA

  • Author

    Li, Jingzhou ; Al-Emran, Ahmed ; Ruhe, Guenther

  • Author_Institution
    Univ. of Calgary, Calgary
  • fYear
    2007
  • Firstpage
    126
  • Lastpage
    135
  • Abstract
    Effort estimation by analogy (EBA) is often confronted with missing values. Our former analogy- based method AUQA is able to tolerate missing values in the data set, but it is unclear how the percentage of missing values impacts the prediction accuracy and if there is an upper bound for how big this percentage might become in order to guarantee the applicability of AQUA. This paper investigates these questions through an impact analysis. The impact analysis is conducted for seven data sets being of different size and having different initial percentages of missing values. The major results are that (i) we confirm the intuition that the more missing values, the poorer the prediction accuracy of AQUA; (ii) there is a quadratic dependency between the prediction accuracy and the percentage of missing values; and (Hi) the upper limit of missing values for the applicability of AQUA is determined as 40%. These results are obtained in the context of AQUA. Further analysis is necessary for other ways of applying EBA, such as using different similarity measures or analogy adaptation methods from those used in AQUA. For that purpose, the experimental design in this study can be adapted.
  • Keywords
    software development management; AQUA; analogy-based software effort estimation method; effort estimation by analogy; impact analysis; missing values; prediction accuracy; upper bound; Accuracy; Collaboration; Design for experiments; Filtering; Information retrieval; Laboratories; Software engineering; Software measurement; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Empirical Software Engineering and Measurement, 2007. ESEM 2007. First International Symposium on
  • Conference_Location
    Madrid
  • ISSN
    1938-6451
  • Print_ISBN
    978-0-7695-2886-1
  • Type

    conf

  • DOI
    10.1109/ESEM.2007.10
  • Filename
    4343740