• DocumentCode
    2010065
  • Title

    On using a benchmark to evaluate C++ extractors

  • Author

    Sim, Susan Elliott ; Holt, Richard C. ; Easterbrook, Steve

  • Author_Institution
    Dept. of Comput. Sci., Toronto Univ., Ont., Canada
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    114
  • Lastpage
    123
  • Abstract
    In this paper, we take the concept of benchmarking, as used extensively in computing, and apply it to the evaluation of C++ fact extractors. We demonstrate the efficacy of this approach by developing a prototype benchmark, CppETS 1.0 (C++ Extractor Test Suite, pronounced ´see-pets´) and collecting feedback in a workshop setting. The CppETS benchmark characterises C++ extractors along two dimensions: accuracy and robustness. It consists of a series of test buckets that contain small C++ programs and related questions that pose different challenges to the extractors. As with other research areas, benchmarks are best developed through technical work and consultation with a community, so we invited researchers to apply CppETS to their extractors and report on their results in a workshop. Four teams participated in this effort, evaluating the four extractors Ccia, cppx, the Rigi C++ parser and TkSee/SN. They found that CppETS gave results that were consistent with their experience with these tools and therefore had good external validity. Workshop participants agreed that CppETS was an important contribution to fact extractor development and testing. Further efforts to make CppETS a widely-accepted benchmark will involve technical improvements and collaboration with the broader community.
  • Keywords
    C++ language; program testing; reverse engineering; software performance evaluation; C++ Extractor Test Suite; C++ fact extractor evaluation; Ccia; CppETS 1.0; Rigi C++ parser; TkSee/SN; accuracy; benchmarking; collaboration; cppx; external validity; feedback; robustness; technical improvements; test buckets; workshop setting; Benchmark testing; Collaborative work; Computer science; Data mining; Feedback; Humans; Prototypes; Reverse engineering; Robustness; Tin;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Program Comprehension, 2002. Proceedings. 10th International Workshop on
  • ISSN
    1092-8138
  • Print_ISBN
    0-7695-1495-2
  • Type

    conf

  • DOI
    10.1109/WPC.2002.1021331
  • Filename
    1021331