Title :
Assessing the benefits of imputing ERP projects with missing data
Author :
Myrtveit, Ingunn ; Stensrud, Erik ; Olsson, Ulf
Abstract :
Incomplete, or missing data is likely to be encountered in empirical software engineering data sets. The authors evaluate some methods for handling missing data. The methods are presented and discussed in general and thereafter applied to effort estimation of ERP projects. We found that two sampling based methods, mean imputation (MI) and similar response pattern imputation (SRPI), waste less information than listwise deletion (LD). However, MI may introduce more bias than the SRPI method. Compared to sampling based methods, likelihood based imputation methods require too large data sets to be realistic to use in empirical software engineering. None of the sampling based methods, such as MI and SRPI, seem able to correct bias. So, though imputation is an attractive idea, the available methods still have severe limitations
Keywords :
business data processing; software cost estimation; software metrics; systems re-engineering; ERP projects; SRPI method; effort estimation; empirical software engineering; empirical software engineering data sets; enterprise resource planning; large data sets; likelihood based imputation methods; listwise deletion; mean imputation; missing data; package enabled reengineering; sampling based methods; similar response pattern imputation; Costs; Data analysis; Data engineering; Engineering management; Enterprise resource planning; History; Packaging; Regression analysis; Software engineering; Waste materials;
Conference_Titel :
Software Metrics Symposium, 2001. METRICS 2001. Proceedings. Seventh International
Conference_Location :
London
Print_ISBN :
0-7695-1043-4
DOI :
10.1109/METRIC.2001.915517