• DocumentCode
    644075
  • Title

    A Replicable Infrastructure for Empirical Studies of Email Archives

  • Author

    Squire, Megan

  • Author_Institution
    Dept. of Comput. Sci., Elon Univ., Elon, NC, USA
  • fYear
    2013
  • fDate
    9-9 Oct. 2013
  • Firstpage
    43
  • Lastpage
    49
  • Abstract
    This paper describes a replicable infrastructure solution for conducting empirical software engineering studies based on email mailing list archives. Mailing list emails, such as those affiliated with free, libre, and open source software (FLOSS) projects, are currently archived in several places online, but each research team that wishes to study these email artifacts closely must design their own solution for collection, storage and cleaning of the data. Consequently, research results will be difficult to replicate, especially as the email archive for any living project will still be continually growing. This paper describes a simple, replicable infrastructure for the collection, storage, and cleaning of project email data and analyses.
  • Keywords
    electronic mail; public domain software; software engineering; FLOSS projects; email archives; email artifacts; email mailing list archives; empirical software engineering studies; free, libre, and open source software projects; mailing list emails; project email data cleaning; project email data collection; project email data storage; replicable infrastructure solution; Cleaning; Databases; Electronic mail; Google; HTML; Servers; Software; archive; cleaning; collection; database; document-oriented database; email; mailing list; storage;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Replication in Empirical Software Engineering Research (RESER), 2013 3rd International Workshop on
  • Conference_Location
    Baltimore, MD
  • Type

    conf

  • DOI
    10.1109/RESER.2013.11
  • Filename
    6664730