Title :
Identifying Effective Test Cases through K-Means Clustering for Enhancing Regression Testing
Author :
Yulei Pang ; Xiaozhen Xue ; Namin, Akbar Siami
Author_Institution :
Dept. of Math. & Stat., Texas Tech Univ., Lubbock, TX, USA
Abstract :
Testing is the most time consuming and expensive process in the software development life cycle. In order to reduce the cost of regression testing, we propose a test case classification methodology based on k-means clustering with the purpose of classifying test cases into two groups of effective and non-effective test cases. The clustering strategy is based on Hamming distances measured over the differences between coverage information obtained for current and the previous releases of the program under test. Our empirical study shows that the clustering-based test case classification can identify effective test cases with high recall ratio and considerable accuracy percentage. The paper also investigates and compares the performance of the proposed clustering-based approach with various factors including coverage criteria and the weights factor used in measuring distances.
Keywords :
pattern classification; product life cycle management; program testing; regression analysis; statistical testing; Hamming distances; clustering-based test case classification methodology; effective test case identification; k-means clustering strategy; regression testing cost reduction; software development life cycle; software testing; Accuracy; Classification algorithms; Clustering algorithms; Hamming distance; Measurement; Software; Testing; k-means clustering; regression testing; test case classification;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2013 12th International Conference on
Conference_Location :
Miami, FL
DOI :
10.1109/ICMLA.2013.109