Title :
Performance Analysis of Datamining Algorithms for Software Quality Prediction
Author :
Gayatri, N. ; Nickolas, S. ; Reddy, A.V. ; Chitra, R.
Author_Institution :
Dept. of Comput. Applic., Nat. Inst. of Technol., Tiruchirappalli, India
Abstract :
Data mining techniques are applied in building software fault prediction models for improving the software quality. Early identification of high-risk modules can assist in quality enhancement efforts to modules that are likely to have a high number of faults. Classification tree models are simple and effective as software quality prediction models, and timely predictions of defects from such models can be used to achieve high software reliability. In this paper, the performance of five data mining classifier algorithms named J48, CART, Random Forest, BFTree and Naive Bayesian classifier (NBC) are evaluated based on 10 fold cross validation test. Experimental results using KC2 NASA software metrics dataset demonstrates that decision trees are much useful for fault predictions and based on rules generated only some measurement attributes in the given set of the metrics play an important role in establishing final rules and for improving the software quality by giving correct predictions. Thus we can suggest that these attributes are sufficient for future classification process. To evaluate the performance of the above algorithms Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Receiver Operating Characteristic (ROC) and Accuracy measures are applied.
Keywords :
data mining; pattern classification; software performance evaluation; software quality; software reliability; BFTree; CART; J48; accuracy measure; classification tree model; data mining; mean absolute error; naive Bayesian classifier; performance analysis; quality enhancement; random forest; receiver operating characteristic; root mean squared error; software fault prediction model; software quality prediction; software reliability; Bayesian methods; Classification tree analysis; Data mining; Fault diagnosis; Niobium compounds; Performance analysis; Predictive models; Software algorithms; Software quality; Software reliability; BFTree; CART; Classification; CrossValidation; J48; Naive Bayesian; Random forest; Software Quality;
Conference_Titel :
Advances in Recent Technologies in Communication and Computing, 2009. ARTCom '09. International Conference on
Conference_Location :
Kottayam, Kerala
Print_ISBN :
978-1-4244-5104-3
Electronic_ISBN :
978-0-7695-3845-7
DOI :
10.1109/ARTCom.2009.12