Title :
Software Fault Prediction Using Quad Tree-Based K-Means Clustering Algorithm
Author :
Bishnu, Partha Sarathi ; Bhattacherjee, Vandana
Author_Institution :
Dept. of Comput. Sci. & Eng., Birla Inst. of Technol., Ranchi, India
fDate :
6/1/2012 12:00:00 AM
Abstract :
Unsupervised techniques like clustering may be used for fault prediction in software modules, more so in those cases where fault labels are not available. In this paper a Quad Tree-based K-Means algorithm has been applied for predicting faults in program modules. The aims of this paper are twofold. First, Quad Trees are applied for finding the initial cluster centers to be input to the A´-Means Algorithm. An input threshold parameter δ governs the number of initial cluster centers and by varying δ the user can generate desired initial cluster centers. The concept of clustering gain has been used to determine the quality of clusters for evaluation of the Quad Tree-based initialization algorithm as compared to other initialization techniques. The clusters obtained by Quad Tree-based algorithm were found to have maximum gain values. Second, the Quad Tree- based algorithm is applied for predicting faults in program modules. The overall error rates of this prediction approach are compared to other existing algorithms and are found to be better in most of the cases.
Keywords :
pattern clustering; quadtrees; software fault tolerance; clustering gain; fault prediction; quad tree-based initialization algorithm; quad tree-based k-means clustering algorithm; software fault prediction; software modules; unsupervised techniques; Algorithm design and analysis; Clustering algorithms; Error analysis; Measurement; Prediction algorithms; Software; Software algorithms; K-Means clustering; Quad Tree; software fault prediction.;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2011.163