• DocumentCode
    731513
  • Title

    An Enhanced Graph-Based Infrastructure for Software Search Engines

  • Author

    Schumacher, Marcus ; Atkinson, Colin

  • Author_Institution
    Dept. of Software Eng., Univ. of Mannheim, Mannheim, Germany
  • fYear
    2015
  • fDate
    16-17 May 2015
  • Firstpage
    386
  • Lastpage
    390
  • Abstract
    The first generation of software search engines such as Merobase, Sourcerer etc. Showed that it is possible to support reasonably sophisticated searches over large bodies of software components using indices based on full-text search engines (most commonly Lucene). However, the "tricks" these engines use to map code structure to flat text are not only inflexible, they do not scale well to components composed of multiple program modules (e.g. Interfaces, classes etc.) As a result, beyond plain string matching, they are only able to support a limited and a priori fixed set of query types, and are rarely, if ever able, to find components composed of more than one code module. In this paper we present an index representation approach which is able to support the key information bound up in source code in a more accurate, flexible way, and thus efficiently support a much wider range of searches on components composed of multiple program modules.
  • Keywords
    graph theory; search engines; enhanced graph-based infrastructure; full-text search engines; index representation approach; plain string matching; software search engines; Computer languages; Indexes; Search engines; Search problems; Software; Syntactics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mining Software Repositories (MSR), 2015 IEEE/ACM 12th Working Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/MSR.2015.46
  • Filename
    7180100