DocumentCode
731513
Title
An Enhanced Graph-Based Infrastructure for Software Search Engines
Author
Schumacher, Marcus ; Atkinson, Colin
Author_Institution
Dept. of Software Eng., Univ. of Mannheim, Mannheim, Germany
fYear
2015
fDate
16-17 May 2015
Firstpage
386
Lastpage
390
Abstract
The first generation of software search engines such as Merobase, Sourcerer etc. Showed that it is possible to support reasonably sophisticated searches over large bodies of software components using indices based on full-text search engines (most commonly Lucene). However, the "tricks" these engines use to map code structure to flat text are not only inflexible, they do not scale well to components composed of multiple program modules (e.g. Interfaces, classes etc.) As a result, beyond plain string matching, they are only able to support a limited and a priori fixed set of query types, and are rarely, if ever able, to find components composed of more than one code module. In this paper we present an index representation approach which is able to support the key information bound up in source code in a more accurate, flexible way, and thus efficiently support a much wider range of searches on components composed of multiple program modules.
Keywords
graph theory; search engines; enhanced graph-based infrastructure; full-text search engines; index representation approach; plain string matching; software search engines; Computer languages; Indexes; Search engines; Search problems; Software; Syntactics;
fLanguage
English
Publisher
ieee
Conference_Titel
Mining Software Repositories (MSR), 2015 IEEE/ACM 12th Working Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/MSR.2015.46
Filename
7180100
Link To Document