DocumentCode :
2139775
Title :
Recovering a Balanced Overview of Topics in a Software Domain
Author :
Kelly, Matthew B. ; Alexander, Jason S. ; Adams, Bram ; Hassan, Ahmed E.
Author_Institution :
Sch. of Comput., Queen´´s Univ., Kingston, ON, Canada
fYear :
2011
fDate :
25-26 Sept. 2011
Firstpage :
135
Lastpage :
144
Abstract :
Domain analysis is a crucial step in the development of product lines and software reuse in general, in which domain experts try to identify the commonalities and variability between different products of a particular domain. This identification is challenging, since it requires significant manual analysis of requirements, design documents, and source code. In order to support domain analysts, this paper proposes to use topic modeling techniques to automatically identify common and unique concepts (topics) from the source code of different software products in a domain. An empirical case study of 19 projects, spread across the domains of web browsers and operating systems (totaling over 39 MLOC), shows that our approach is able to identify commonalities and variabilities at different levels of granularity (sub-domain and domain). In addition, we show how the commonalities are evenly spread across all projects of the domain.
Keywords :
data analysis; software reusability; Web browser; domain analysis; operating systems; product line development; software domain; software product; software reuse; topic balanced overview recovery; topic modeling technique; Analytical models; Browsers; Computer architecture; Operating systems; Software systems; Unified modeling language; domain analysis; empirical study; topic modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Source Code Analysis and Manipulation (SCAM), 2011 11th IEEE International Working Conference on
Conference_Location :
Williamsburg, VI
Print_ISBN :
978-1-4577-0932-6
Type :
conf
DOI :
10.1109/SCAM.2011.23
Filename :
6065172
Link To Document :
بازگشت