Title :
Improving IR-based Traceability Recovery Using Smoothing Filters
Author :
De Lucia, Andrea ; Penta, Massimiliano Di ; Oliveto, Rocco ; Panichella, Annibale ; Panichella, Sebastiano
Author_Institution :
Software Eng. Lab., Univ. of Salerno, Fisciano, Italy
Abstract :
Information Retrieval methods have been largely adopted to identify traceability links based on the textual similarity of software artifacts. However, noise due to word usage in software artifacts might negatively affect the recovery accuracy. We propose the use of smoothing filters to reduce the effect of noise in software artifacts and improve the performances of traceability recovery methods. An empirical evaluation performed on two repositories indicates that the usage of a smoothing filter is able to significantly improve the performances of Vector Space Model and Latent Semantic Indexing. Such a result suggests that other than being used for traceability recovery the proposed filter can be used to improve performances of various other software engineering approaches based on textual analysis.
Keywords :
filtering theory; indexing; information retrieval; program diagnostics; software engineering; IR based traceability recovery; information retrieval methods; latent semantic indexing; smoothing filters; software artifacts; software engineering approaches; vector space model; Accuracy; Large scale integration; Noise; Pixel; Smoothing methods; Software; Vocabulary; Empirical Software Engineering; Information Retrieval; Smoothing Filters; Software Traceability;
Conference_Titel :
Program Comprehension (ICPC), 2011 IEEE 19th International Conference on
Conference_Location :
Kingston, ON
Print_ISBN :
978-1-61284-308-7
Electronic_ISBN :
1092-8138
DOI :
10.1109/ICPC.2011.34