Title :
Improving tree-based models of software quality with principal components analysis
Author :
Khoshgoftaar, Taghi M. ; Shan, Ruqun M. ; Allen, Edward B.
Author_Institution :
Florida Atlantic Univ., Boca Raton, FL, USA
Abstract :
Software quality classification models can predict which modules are to be considered fault-prone, and which are not, based on software product metrics, process metrics and execution metrics. Such predictions can be used to target improvement efforts to those modules that need them the most. Classification-tree modeling is a robust technique for building such software quality models. However, the model structure may be unstable, and accuracy may suffer when the predictors are highly correlated. This paper presents an empirical case study of four releases of a very large telecommunications system, which shows that the tree-based models can be improved by transforming the predictors with principal components analysis, so that the transformed predictors are not correlated. The case study used the regression-tree algorithm in the S-Plus package and then applied a general decision rule to classify the modules
Keywords :
pattern classification; principal component analysis; software metrics; software packages; software quality; subroutines; telecommunication computing; trees (mathematics); S-Plus package; accuracy; case study; classification-tree modeling; correlated predictors; decision rule; fault-prone modules; improvement efforts; principal components analysis; program module classification; regression-tree algorithm; software execution metrics; software process metrics; software product metrics; software quality classification models; telecommunications system; transformed predictors; tree-based models; unstable model structure; Accuracy; Buildings; Classification tree analysis; Logistics; Packaging; Predictive models; Principal component analysis; Robustness; Software metrics; Software quality;
Conference_Titel :
Software Reliability Engineering, 2000. ISSRE 2000. Proceedings. 11th International Symposium on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-0807-3
DOI :
10.1109/ISSRE.2000.885872