DocumentCode :
731534
Title :
Dataset of Developer-Labeled Commit Messages
Author :
Mauczka, Andreas ; Brosch, Florian ; Schanes, Christian ; Grechenig, Thomas
Author_Institution :
Inst. of Ind. Software, Vienna Univ. of Technol., Vienna, Austria
fYear :
2015
fDate :
16-17 May 2015
Firstpage :
490
Lastpage :
493
Abstract :
Current research on change classification centers around automated and semi-automated approaches which are based on evaluation by either the researchers themselves or external experts. In most cases, the persons evaluating the effectiveness of the classification schemes are not the authors of the original changes and therefore can only make assumptions about the intent of the changes. To support validation of existing labeling mechanisms and to provide a training set for future approaches, we present a survey of source code changes that were labeled by their original authors. Seven developers from six different project applied three existing classification schemes from current literature to enrich their own changes with meta-information, so the intent of the changes becomes more evident. The final data set consists of 967 classified changes and is available as an SQLite database as part of the MSR data set.
Keywords :
pattern classification; SQLite database; classification scheme; developer-labeled commit messages dataset; source code changes; Data mining; Data models; Databases; Labeling; Maintenance engineering; Usability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mining Software Repositories (MSR), 2015 IEEE/ACM 12th Working Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/MSR.2015.71
Filename :
7180125
Link To Document :
بازگشت