DocumentCode :
700353
Title :
Modeling the evolution of development topics using Dynamic Topic Models
Author :
Jiajun Hu ; Xiaobing Sun ; Lo, David ; Bin Li
Author_Institution :
Sch. of Inf. Eng., Yangzhou Univ., Yangzhou, China
fYear :
2015
fDate :
2-6 March 2015
Firstpage :
3
Lastpage :
12
Abstract :
As the development of a software project progresses, its complexity grows accordingly, making it difficult to understand and maintain. During software maintenance and evolution, software developers and stakeholders constantly shift their focus between different tasks and topics. They need to investigate into software repositories (e.g., revision control systems) to know what tasks have recently been worked on and how much effort has been devoted to them. For example, if an important new feature request is received, an amount of work that developers perform on ought to be relevant to the addition of the incoming feature. If this does not happen, project managers might wonder what kind of work developers are currently working on. Several topic analysis tools based on Latent Dirichlet Allocation (LDA) have been proposed to analyze information stored in software repositories to model software evolution, thus helping software stakeholders to be aware of the focus of development efforts at various time during software evolution. Previous LDA-based topic analysis tools can capture either changes on the strengths of various development topics over time (i.e., strength evolution) or changes in the content of existing topics over time (i.e., content evolution). Unfortunately, none of the existing techniques can capture both strength and content evolution. In this paper, we use Dynamic Topic Models (DTM) to analyze commit messages within a project´s lifetime to capture both strength and content evolution simultaneously. We evaluate our approach by conducting a case study on commit messages of two well-known open source software systems, jEdit and PostgreSQL. The results show that our approach could capture not only how the strengths of various development topics change over time, but also how the content of each topic (i.e., words that form the topic) changes over time. Compared with existing topic analysis approaches, our approach can provide a more complete and valuable vi- w of software evolution to help developers better understand the evolution of their projects.
Keywords :
project management; software development management; software maintenance; DTM; LDA-based topic analysis tools; PostgreSQL; content evolution; development topics; dynamic topic models; evolution modeling; jEdit; latent Dirichlet allocation; open source software systems; project lifetime; project managers; revision control systems; software evolution; software maintenance; software project development; software repositories; software stakeholders; Analytical models; Computational modeling; Control systems; Data mining; Indexes; Measurement; Software;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Analysis, Evolution and Reengineering (SANER), 2015 IEEE 22nd International Conference on
Conference_Location :
Montreal, QC
Type :
conf
DOI :
10.1109/SANER.2015.7081810
Filename :
7081810
Link To Document :
بازگشت