DocumentCode :
2797236
Title :
Mining Eclipse Developer Contributions via Author-Topic Models
Author :
Linstead, Erik ; Rigor, Paul ; Bajracharya, Sushil ; Lopes, Cristina ; Baldi, Pierre
Author_Institution :
Univ. of California at Irvine, Irvine
fYear :
2007
fDate :
20-26 May 2007
Firstpage :
30
Lastpage :
30
Abstract :
We present the results of applying statistical author-topic models to a subset of the Eclipse 3.0 source code consisting of 2,119 source files and 700,000 lines of code from 59 developers. This technique provides an intuitive and automated framework with which to mine developer contributions and competencies from a given code base while simultaneously extracting software function in the form of topics. In addition to serving as a convenient summary for program function and developer activities, our study shows that topic models provide a meaningful, effective, and statistical basis for developer similarity analysis.
Keywords :
Java; data mining; data warehouses; project management; software development management; software tools; statistical analysis; Eclipse 3.0 source code; Eclipse developer contribution mining; Java projects; open source software repositories; statistical author-topic models; Availability; Bioinformatics; Computer languages; Genomics; Information retrieval; Java; Linear discriminant analysis; Mining industry; Open source software; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mining Software Repositories, 2007. ICSE Workshops MSR '07. Fourth International Workshop on
Conference_Location :
Minneapolis, MN
Print_ISBN :
0-7695-2950-X
Type :
conf
DOI :
10.1109/MSR.2007.20
Filename :
4228667
Link To Document :
بازگشت