• DocumentCode
    2544191
  • Title

    Scalable Automatic Concept Mining from Execution Traces

  • Author

    Medini, Soumaya

  • Author_Institution
    SOCCER Lab., Ecole Polytech. de Montreal, Montreal, QC, Canada
  • fYear
    2011
  • fDate
    22-24 June 2011
  • Firstpage
    238
  • Lastpage
    241
  • Abstract
    Concept identification is the task of locating and identifying concepts (e.g., domain concepts) into code region or, more generally, into artifact chunks. Concept identification is fundamental to program comprehension, software maintenance, and evolution. Different static, dynamic, and hybrid approaches for concept identification exist in the literature. Both static and dynamic techniques have advantages and limitations. In fact, they can be considered to complement each other. Indeed, recent works focused on hybrid techniques to improve the performance in time as well as accuracy (i.e., precision and recall) of the concept location process. Furthermore, sometimes only a single execution trace is available, however, to the best of our knowledge, only few works attempt to automatically identify concepts in a single execution trace. We propose an approach built upon a dynamic-programming algorithm to split an execution trace into segments likely representing concepts. The approach improves performance and scalability with respect to currently available techniques. We also plan to use techniques derived from Latent Dirichlet Allocation (LDA)to automatically assign meanings to segments.
  • Keywords
    data mining; dynamic programming; program diagnostics; software maintenance; artifact chunks; concept identification; concept location process; dynamic programming algorithm; execution traces; latent dirichlet allocation; program comprehension; scalable automatic concept mining; software evolution; software maintenance; Data mining; Heuristic algorithms; Information retrieval; Resource management; Scalability; Software maintenance; Concept identification; Dynamic analysis; Information retrieval; Latent Dirichlet Allocation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Program Comprehension (ICPC), 2011 IEEE 19th International Conference on
  • Conference_Location
    Kingston, ON
  • ISSN
    1092-8138
  • Print_ISBN
    978-1-61284-308-7
  • Electronic_ISBN
    1092-8138
  • Type

    conf

  • DOI
    10.1109/ICPC.2011.44
  • Filename
    5970171