On using a benchmark to evaluate C++ extractors

Author

Sim, Susan Elliott ; Holt, Richard C. ; Easterbrook, Steve

Author_Institution

Dept. of Comput. Sci., Toronto Univ., Ont., Canada

fYear

2002

fDate

2002

Firstpage

114

Lastpage

123

Abstract

In this paper, we take the concept of benchmarking, as used extensively in computing, and apply it to the evaluation of C++ fact extractors. We demonstrate the efficacy of this approach by developing a prototype benchmark, CppETS 1.0 (C++ Extractor Test Suite, pronounced ´see-pets´) and collecting feedback in a workshop setting. The CppETS benchmark characterises C++ extractors along two dimensions: accuracy and robustness. It consists of a series of test buckets that contain small C++ programs and related questions that pose different challenges to the extractors. As with other research areas, benchmarks are best developed through technical work and consultation with a community, so we invited researchers to apply CppETS to their extractors and report on their results in a workshop. Four teams participated in this effort, evaluating the four extractors Ccia, cppx, the Rigi C++ parser and TkSee/SN. They found that CppETS gave results that were consistent with their experience with these tools and therefore had good external validity. Workshop participants agreed that CppETS was an important contribution to fact extractor development and testing. Further efforts to make CppETS a widely-accepted benchmark will involve technical improvements and collaboration with the broader community.

Keywords

C++ language; program testing; reverse engineering; software performance evaluation; C++ Extractor Test Suite; C++ fact extractor evaluation; Ccia; CppETS 1.0; Rigi C++ parser; TkSee/SN; accuracy; benchmarking; collaboration; cppx; external validity; feedback; robustness; technical improvements; test buckets; workshop setting; Benchmark testing; Collaborative work; Computer science; Data mining; Feedback; Humans; Prototypes; Reverse engineering; Robustness; Tin;

fLanguage

English

Publisher

ieee

Conference_Titel

Program Comprehension, 2002. Proceedings. 10th International Workshop on

ISSN

1092-8138

Print_ISBN

0-7695-1495-2

Type

conf

DOI

10.1109/WPC.2002.1021331

Filename

1021331