DocumentCode
234770
Title
Automatic bug labeling using semantic information from LSI
Author
Chawla, Indu ; Singh, S.K.
Author_Institution
Dept. of Comput. Sci. & IT, Jaypee Inst. of Inf. Technol., Noida, India
fYear
2014
fDate
7-9 Aug. 2014
Firstpage
376
Lastpage
381
Abstract
Most open source projects provide a defect tracking system, where users, developers, testers can directly report the problems. The fields provided in the bug report help triager and debugger to understand the problem better. They also help in other tasks like accurate assessment of priority and severity of bugs, identification of appropriate developer to resolve bugs etc. Label field in the bug report is one such field. It has been observed that in many bug repositories, the label field is either not present or is incorrectly assigned. There is a need for automatic bug labeling so that bug reports could be made more informative. This paper presents an automated technique for bug labeling using TF-IDF and LSI. Experimental study shows that there is improvement in results with the addition of semantically similar words obtained from LSI in conjunction with the terms extracted using TF-IDF. Using LSI along with TF-IDF, we achieved 61.5% accuracy for the polish bug reports and 62.8% accuracy for security bug reports as compared to 53.8% accuracy for polish and 61% for security bug reports from using TF-IDF alone.
Keywords
program debugging; LSI; TF-IDF; automatic bug labeling; defect tracking system; open source projects; polish bug reports; security bug reports; semantic information; Accuracy; Browsers; Computer bugs; Labeling; Large scale integration; Security; Training; Bug categorization; Bug labeling; LSI; Latent Semantic Indexing;
fLanguage
English
Publisher
ieee
Conference_Titel
Contemporary Computing (IC3), 2014 Seventh International Conference on
Conference_Location
Noida
Print_ISBN
978-1-4799-5172-7
Type
conf
DOI
10.1109/IC3.2014.6897203
Filename
6897203
Link To Document