DocumentCode
3373095
Title
Assessing uncertain predictions of software quality
Author
Khoshgoftaar, Taghi M. ; Allen, Edward B. ; Yuan, Xiaojing ; Jones, Wendell D. ; Hudepohl, John P.
Author_Institution
Florida Atlantic Univ., Boca Raton, FL, USA
fYear
1999
fDate
1999
Firstpage
159
Lastpage
168
Abstract
Many development organizations try to minimize faults in software as a means for improving customer satisfaction. Assuring high software quality often entails time-consuming and costly development processes. A software quality model based on software metrics can be used to guide enhancement efforts by predicting which modules are fault-prone. The paper presents a way to determine which predictions by a classification tree should be considered uncertain. We conducted a case study of a large legacy telecommunications system. One release was the basis for the training data set, and the subsequent release was the basis for the evaluation data set. We built a classification tree using the TREEDISC algorithm, which is based on chi-squared tests of contingency tables. The model predicted whether a module was likely to have faults discovered by customers, or not, based on software product, process, and execution metrics. We simulated practical use of the model by classifying the modules in the evaluation data set. The model achieved useful accuracy, in spite of the very small proportion of fault-prone modules in the system. We assessed whether the classes assigned to the leaves were appropriate by examining the details of the full tree, and found sizable subsets of modules with substantially uncertain classification. Discovering which modules have uncertain classifications allows sophisticated enhancement strategies to resolve uncertainties. Moreover, TREEDISC is especially well suited to identifying uncertain classifications
Keywords
software metrics; software performance evaluation; software quality; telecommunication computing; TREEDISC algorithm; chi-squared tests; classification tree; contingency tables; customer satisfaction; development processes; enhancement strategies; evaluation data set; execution metrics; fault-prone modules; large legacy telecommunications system; software metrics; software product; software quality model; uncertain classifications; uncertain predictions; Automatic testing; Computer science; Customer satisfaction; Predictive models; Software engineering; Software metrics; Software quality; System testing; Training data; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Metrics Symposium, 1999. Proceedings. Sixth International
Conference_Location
Boca Raton, FL
Print_ISBN
0-7695-0403-5
Type
conf
DOI
10.1109/METRIC.1999.809737
Filename
809737
Link To Document