Assessing uncertain predictions of software quality

Author

Khoshgoftaar, Taghi M. ; Allen, Edward B. ; Yuan, Xiaojing ; Jones, Wendell D. ; Hudepohl, John P.

Author_Institution

Florida Atlantic Univ., Boca Raton, FL, USA

fYear

1999

fDate

1999

Firstpage

159

Lastpage

168

Abstract

Many development organizations try to minimize faults in software as a means for improving customer satisfaction. Assuring high software quality often entails time-consuming and costly development processes. A software quality model based on software metrics can be used to guide enhancement efforts by predicting which modules are fault-prone. The paper presents a way to determine which predictions by a classification tree should be considered uncertain. We conducted a case study of a large legacy telecommunications system. One release was the basis for the training data set, and the subsequent release was the basis for the evaluation data set. We built a classification tree using the TREEDISC algorithm, which is based on chi-squared tests of contingency tables. The model predicted whether a module was likely to have faults discovered by customers, or not, based on software product, process, and execution metrics. We simulated practical use of the model by classifying the modules in the evaluation data set. The model achieved useful accuracy, in spite of the very small proportion of fault-prone modules in the system. We assessed whether the classes assigned to the leaves were appropriate by examining the details of the full tree, and found sizable subsets of modules with substantially uncertain classification. Discovering which modules have uncertain classifications allows sophisticated enhancement strategies to resolve uncertainties. Moreover, TREEDISC is especially well suited to identifying uncertain classifications

Keywords

software metrics; software performance evaluation; software quality; telecommunication computing; TREEDISC algorithm; chi-squared tests; classification tree; contingency tables; customer satisfaction; development processes; enhancement strategies; evaluation data set; execution metrics; fault-prone modules; large legacy telecommunications system; software metrics; software product; software quality model; uncertain classifications; uncertain predictions; Automatic testing; Computer science; Customer satisfaction; Predictive models; Software engineering; Software metrics; Software quality; System testing; Training data; Uncertainty;

fLanguage

English

Publisher

ieee

Conference_Titel

Software Metrics Symposium, 1999. Proceedings. Sixth International

Conference_Location

Boca Raton, FL

Print_ISBN

0-7695-0403-5

Type

conf

DOI

10.1109/METRIC.1999.809737

Filename

809737