Author_Institution :
Elon Univ., Elon, NC, USA
Abstract :
Software forges are centralized online systems that provide useful tools to help distributed development teams work together, especially in free, libre, and open source software (FLOSS). Forge-provided tools may include web space, version control systems, mailing lists and communication forums, bug tracking systems, file downloads, wikis, and the like. Empirical software engineering researchers can mine the artifacts from these tools to better understand how FLOSS is made. As the landscape of distributed software development has grown and changed, the tools needed to make FLOSS have changed as well. There are three newer tools at the center of FLOSS development today: distributed version control based forges (like Github), programmer question-and-answer communities (like Stack Overflow), and paste bin tools (like Gist or Pastebin.com). These tools are extending and changing the toolset used for FLOSS development, and redefining what a software forge looks like. The main contributions of this paper are to describe each of these tools, to identify the data and artifacts available for mining from these tools, and to outline some of the ways researchers can use these artifacts to continue to understand how FLOSS is made.
Keywords :
Internet; data mining; distributed processing; public domain software; software engineering; software tools; team working; FLOSS development; Forge++; Gist; Github; Pastebin.com; Stack Overflow; centralized online systems; data mining; distributed development teams; distributed software development; distributed version control based forges; free-libre-open source software; pastebin tools; programmer question-and-answer communities; software forges; Communities; Computer languages; Control systems; Data mining; Google; Licenses; Software; forges; github; open source software; pastebin; repositories; software development; stack overflow;