• DocumentCode
    655065
  • Title

    Using Thesaurus-Based Tag Clouds to Improve Test-Driven Code Search

  • Author

    Lazzarini Lemos, Otavio Augusto ; Carvalho de Paula, Adriano ; Konishi, Gustavo ; Ossher, Joel ; Bajracharya, Sushil ; Lopes, Cristiano

  • Author_Institution
    Sci. & Technol. Dept., Fed. Univ. of Sao Paulo, Sao Jose dos Campos, Brazil
  • fYear
    2013
  • fDate
    Sept. 29 2013-Oct. 4 2013
  • Firstpage
    99
  • Lastpage
    108
  • Abstract
    Test-driven code search (TDCS) is an approach to code search and reuse that uses test cases as inputs to form the search query. Together with the test cases that provide more semantics to the search task, keywords taken from class and method names are still required. Therefore, the effectiveness of the approach also relies on how good these keywords are, i.e., how frequently they are chosen by developers to name the desired functions. To help users choose adequate words in their query test cases, visual aids can be used. In this paper we propose thesaurus-based tag clouds to show developers terms that are more frequently used in the code repository to improve their search. Terms are generated by looking up words similar to the initial keywords on a thesaurus. Tag clouds are then formed based on the frequency in which these terms appear in the code base. Our approach was implemented with an English thesaurus as an extension to CodeGenie, a Java- and Eclipse-based TDCS tool. Our evaluation shows that the approach can help improve the number of returned results. We also noticed the visual aid can be especially useful for non-native speakers of the language in which the code repository is written. These users are frequently unaware of the most common terms used to name specific functionality in the code, in the given language.
  • Keywords
    Java; natural language processing; program testing; query processing; software reusability; CodeGenie; Eclipse-based TDCS tool; English thesaurus; Java-based TDCS tool; code base; code repository; nonnative language speakers; query test cases; search query; search task; test-driven code search; thesaurus-based tag clouds; visual aids; Cloud computing; Color; Search problems; Tag clouds; Thesauri; Visualization; code search; software reuse; test-driven code search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Components, Architectures and Reuse (SBCARS), 2013 VII Brazilian Symposium on
  • Conference_Location
    Brasilia
  • Type

    conf

  • DOI
    10.1109/SBCARS.2013.21
  • Filename
    6685795