DocumentCode
688462
Title
Concept localization using n-gram Information Retrieval model and Control Flow Graph
Author
Jain, Nikhil ; Garg, Radhika ; Chawla, Indu
Author_Institution
Dept. of Comput. Sci., Jaypee Inst. of Inf. Technol., Noida, India
fYear
2013
fDate
26-27 Sept. 2013
Firstpage
29
Lastpage
34
Abstract
Developing software involves many phases such as designing, coding and testing. Once the software is released, a separate team is responsible for maintaining the software. Nowadays many researchers and users work on Open Source Software to enhance its functionalities and to mould it according to their needs. Most of the time, a user or developer wants to locate a specific feature in software for the purpose of enhancement or removing a fault, which is known as concept localization. Automatic concept localization gives relevant files to the users as per the requirement. We have implemented n-gram, an Information Retrieval model to retrieve the names of the relevant files from the source code and incorporated Control Flow Graph (CFG) which helped us to determine the files encapsulating the functionality, in the correct order. We conducted tests on numerous grounds such as different threshold values (0.4, 0.6 and 0.8), N value (2 and 3) and varying query length. On examination, we obtained recall of 74% and precision of 65% on threshold value of 0.6 using trigram (i.e. n=3). Control Flow Graph significantly contributed in improving the ranking of relevant files.
Keywords
graph theory; information retrieval; public domain software; software engineering; -gram information retrieval model; CFG; automatic concept localization; control flow graph; open source software; software developmemt; CFG; Concept Localization; N-Gram; Precision; Ranking documents; Recall; Tf-Id;
fLanguage
English
Publisher
iet
Conference_Titel
Confluence 2013: The Next Generation Information Technology Summit (4th International Conference)
Conference_Location
Noida
Electronic_ISBN
978-1-84919-846-2
Type
conf
DOI
10.1049/cp.2013.2289
Filename
6832304
Link To Document