Title :
The Qualitas Corpus: A Curated Collection of Java Code for Empirical Studies
Author :
Tempero, Ewan ; Anslow, Craig ; Dietrich, Jens ; Han, Ted ; Li, Jing ; Lumpe, Markus ; Melton, Hayden ; Noble, James
Author_Institution :
Dept. of Comput. Sci., Univ. of Auckland, Auckland, New Zealand
fDate :
Nov. 30 2010-Dec. 3 2010
Abstract :
In order to increase our ability to use measurement to support software development practise we need to do more analysis of code. However, empirical studies of code are expensive and their results are difficult to compare. We describe the Qualitas Corpus, a large curated collection of open source Java systems. The corpus reduces the cost of performing large empirical studies of code and supports comparison of measurements of the same artifacts. We discuss its design, organisation, and issues associated with its development.
Keywords :
Java; codes; software engineering; Java code; Qualitas Corpus; curated collection; open source Java systems; software development; Benchmark testing; Java; Libraries; Pragmatics; Software; Software engineering; Empirical studies; curated code corpus; experimental infrastructure;
Conference_Titel :
Software Engineering Conference (APSEC), 2010 17th Asia Pacific
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-8831-5
Electronic_ISBN :
1530-1362
DOI :
10.1109/APSEC.2010.46