Title :
Analysis of Activity in the Open Source Software Development Community
Author :
Christley, Scott ; Madey, Greg
Author_Institution :
Dept. of Comput. Sci. & Eng., Notre Dame Univ.
Abstract :
Open source software is computer software for which the source code is publicly open for inspection, modification, and redistribution. While research of a few, large, successful projects have provided insights into the nature and practices of the open source software community; it still leaves open the question about the thousands of other open source projects which are neither large or highly successful. In this paper, we describe a data set of SourceForge.net, the world´s largest open source software development site, which is available for research purposes; we discuss various data mining techniques that can be applied to the data and the type of research questions that can be answered. We apply a few of these techniques and provide analysis of the results
Keywords :
Web sites; data mining; public domain software; very large databases; SourceForge.net open source software development Web site; computer software; data mining techniques; Computer bugs; Data mining; Databases; History; Information analysis; Inspection; Open source software; Programming; Project management; Writing;
Conference_Titel :
System Sciences, 2007. HICSS 2007. 40th Annual Hawaii International Conference on
Conference_Location :
Waikoloa, HI
Electronic_ISBN :
1530-1605
DOI :
10.1109/HICSS.2007.74