Title :
Improving the Precision of Dependence-Based Defect Mining by Supervised Learning of Rule and Violation Graphs
Author :
Sun, Boya ; Podgurski, Andy ; Ray, Soumya
Author_Institution :
EECS Dept., Case Western Reserve Univ., Cleveland, OH, USA
Abstract :
Previous work has shown that application of graph mining techniques to system dependence graphs improves the precision of automatic defect discovery by revealing subgraphs corresponding to implicit programming rules and to rule violations. However, developers must still confirm, edit, or discard reported rules and violations, which is both costly and error-prone. In order to reduce developer effort and further improve precision, we investigate the use of supervised learning models for classifying and ranking rule and violation subgraphs. In particular, we present and evaluate logistic regression models for rules and violations, respectively, which are based on general dependence-graph features. Our empirical results indicate that (i) use of these models can significantly improve the precision and recall of defect discovery, and (ii) our approach is superior to existing heuristic approaches to rule and violation ranking and to an existing static-warning classifier, and (iii) accurate models can be learned using only a few labeled examples.
Keywords :
data mining; graphs; learning (artificial intelligence); pattern classification; regression analysis; automatic defect discovery; dependence based defect mining; dependence graph feature; graph mining technique; implicit programming rule; logistic regression model; static warning classifier; supervised learning; violation graph; violation ranking; Classification algorithms; Computational modeling; Computer bugs; Data mining; Logistics; Measurement; Programming; defect classification; defect mining; dependence graph; logistic regression;
Conference_Titel :
Software Reliability Engineering (ISSRE), 2010 IEEE 21st International Symposium on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-9056-1
Electronic_ISBN :
1071-9458
DOI :
10.1109/ISSRE.2010.37