• DocumentCode
    635224
  • Title

    Automatic query reformulations for text retrieval in software engineering

  • Author

    Haiduc, Sonia ; Bavota, Gabriele ; Marcus, Andrian ; Oliveto, Rocco ; De Lucia, Andrea ; Menzies, T.

  • Author_Institution
    Wayne State Univ., Detroit, MI, USA
  • fYear
    2013
  • fDate
    18-26 May 2013
  • Firstpage
    842
  • Lastpage
    851
  • Abstract
    There are more than twenty distinct software engineering tasks addressed with text retrieval (TR) techniques, such as, traceability link recovery, feature location, refactoring, reuse, etc. A common issue with all TR applications is that the results of the retrieval depend largely on the quality of the query. When a query performs poorly, it has to be reformulated and this is a difficult task for someone who had trouble writing a good query in the first place. We propose a recommender (called Refoqus) based on machine learning, which is trained with a sample of queries and relevant results. Then, for a given query, it automatically recommends a reformulation strategy that should improve its performance, based on the properties of the query. We evaluated Refoqus empirically against four baseline approaches that are used in natural language document retrieval. The data used for the evaluation corresponds to changes from five open source systems in Java and C++ and it is used in the context of TR-based concept location in source code. Refoqus outperformed the baselines and its recommendations lead to query performance improvement or preservation in 84% of the cases (in average).
  • Keywords
    C++ language; Java; learning (artificial intelligence); query formulation; recommender systems; software engineering; C++; Java; Refoqus; automatic query reformulation; feature location; machine learning; recommender system; software engineering; text retrieval; traceability link recovery; Context; Engines; Frequency measurement; Natural languages; Robustness; Training; Training data; Query Reformulation; Text Retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering (ICSE), 2013 35th International Conference on
  • Conference_Location
    San Francisco, CA
  • Print_ISBN
    978-1-4673-3073-2
  • Type

    conf

  • DOI
    10.1109/ICSE.2013.6606630
  • Filename
    6606630