DocumentCode
176190
Title
Clustering Commits for Understanding the Intents of Implementation
Author
Yamauchi, Kazuto ; Jiachen Yang ; Hotta, Kazuhiro ; Higo, Y. ; Kusumoto, Shinji
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
fYear
2014
fDate
Sept. 29 2014-Oct. 3 2014
Firstpage
406
Lastpage
410
Abstract
This paper proposes a novel technique for clustering commits for understanding the intents of implementation. Such a classification of commits should be able to assist developers to understand commits related to particular requirements, for example, how and why has this function been implemented, or has this function suffered from any bugs? Our technique adopts a clustering algorithm on identifier names that are related to changes in each commit. Such an approach allows us to take the semantics of each commit into account without commit messages, and so our approach is robust for the situation where some commits lack accurate descriptions. We conducted a pilot study to confirm that our idea answers to our objective. The pilot study found some good examples that showed the usefulness of our approach, and there were some undesirable results that gave some ideas to improve it.
Keywords
configuration management; software maintenance; clustering algorithm; clustering commits; intents of implementation; Clustering algorithms; Data mining; Feature extraction; Semantics; Software; Syntactics; Vectors; Commit Classification; Mining Software Repositories; Version Control System;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Maintenance and Evolution (ICSME), 2014 IEEE International Conference on
Conference_Location
Victoria, BC
ISSN
1063-6773
Type
conf
DOI
10.1109/ICSME.2014.63
Filename
6976107
Link To Document