Title : 
Combining Deep Learning with Information Retrieval to Localize Buggy Files for Bug Reports (N)
         
        
            Author : 
An Ngoc Lam;Anh Tuan Nguyen;Hoan Anh Nguyen;Tien N. Nguyen
         
        
            Author_Institution : 
Iowa State Univ., Ames, IA, USA
         
        
        
        
        
            Abstract : 
Bug localization refers to the automated process of locating the potential buggy files for a given bug report. To help developers focus their attention to those files is crucial. Several existing automated approaches for bug localization from a bug report face a key challenge, called lexical mismatch, in which the terms used in bug reports to describe a bug are different from the terms and code tokens used in source files. This paper presents a novel approach that uses deep neural network (DNN) in combination with rVSM, an information retrieval (IR) technique. rVSM collects the feature on the textual similarity between bug reports and source files. DNN is used to learn to relate the terms in bug reports to potentially different code tokens and terms in source files and documentation if they appear frequently enough in the pairs of reports and buggy files. Our empirical evaluation on real-world projects shows that DNN and IR complement well to each other to achieve higher bug localization accuracy than individual models. Importantly, our new model, HyLoc, with a combination of the features built from DNN, rVSM, and project´s bug-fixing history, achieves higher accuracy than the state-of-the-art IR and machine learning techniques. In half of the cases, it is correct with just a single suggested file. Two out of three cases, a correct buggy file is in the list of three suggested files.
         
        
            Keywords : 
"Feature extraction","History","Metadata","Computer bugs","Software","Bridges","Information retrieval"
         
        
        
            Conference_Titel : 
Automated Software Engineering (ASE), 2015 30th IEEE/ACM International Conference on
         
        
        
            DOI : 
10.1109/ASE.2015.73