DocumentCode
635224
Title
Automatic query reformulations for text retrieval in software engineering
Author
Haiduc, Sonia ; Bavota, Gabriele ; Marcus, Andrian ; Oliveto, Rocco ; De Lucia, Andrea ; Menzies, T.
Author_Institution
Wayne State Univ., Detroit, MI, USA
fYear
2013
fDate
18-26 May 2013
Firstpage
842
Lastpage
851
Abstract
There are more than twenty distinct software engineering tasks addressed with text retrieval (TR) techniques, such as, traceability link recovery, feature location, refactoring, reuse, etc. A common issue with all TR applications is that the results of the retrieval depend largely on the quality of the query. When a query performs poorly, it has to be reformulated and this is a difficult task for someone who had trouble writing a good query in the first place. We propose a recommender (called Refoqus) based on machine learning, which is trained with a sample of queries and relevant results. Then, for a given query, it automatically recommends a reformulation strategy that should improve its performance, based on the properties of the query. We evaluated Refoqus empirically against four baseline approaches that are used in natural language document retrieval. The data used for the evaluation corresponds to changes from five open source systems in Java and C++ and it is used in the context of TR-based concept location in source code. Refoqus outperformed the baselines and its recommendations lead to query performance improvement or preservation in 84% of the cases (in average).
Keywords
C++ language; Java; learning (artificial intelligence); query formulation; recommender systems; software engineering; C++; Java; Refoqus; automatic query reformulation; feature location; machine learning; recommender system; software engineering; text retrieval; traceability link recovery; Context; Engines; Frequency measurement; Natural languages; Robustness; Training; Training data; Query Reformulation; Text Retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering (ICSE), 2013 35th International Conference on
Conference_Location
San Francisco, CA
Print_ISBN
978-1-4673-3073-2
Type
conf
DOI
10.1109/ICSE.2013.6606630
Filename
6606630
Link To Document