• DocumentCode
    2074303
  • Title

    Software traceability with topic modeling

  • Author

    Asuncion, Hazeline U. ; Asuncion, Arthur U. ; Taylor, Richard N.

  • Author_Institution
    Inst. for Software Res., Univ. of California, Irvine, CA, USA
  • Volume
    1
  • fYear
    2010
  • fDate
    2-8 May 2010
  • Firstpage
    95
  • Lastpage
    104
  • Abstract
    Software traceability is a fundamentally important task in software engineering. The need for automated traceability increases as projects become more complex and as the number of artifacts increases. We propose an automated technique that combines traceability with a machine learning technique known as topic modeling. Our approach automatically records traceability links during the software development process and learns a probabilistic topic model over artifacts. The learned model allows for the semantic categorization of artifacts and the topical visualization of the software system. To test our approach, we have implemented several tools: an artifact search tool combining keyword-based search and topic modeling, a recording tool that performs prospective traceability, and a visualization tool that allows one to navigate the software architecture and view semantic topics associated with relevant artifacts and architectural components. We apply our approach to several data sets and discuss how topic modeling enhances software traceability, and vice versa.
  • Keywords
    learning (artificial intelligence); probability; software engineering; automated traceability; machine learning technique; probabilistic topic model; semantic categorization; software architecture; software development process; software engineering; software traceability; topic modeling; topical visualization; Large scale integration; Machine learning; Probabilistic logic; Resource management; Semantics; Software; Visualization; latent dirichlet allocation; software architecture; software traceability; topic model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, 2010 ACM/IEEE 32nd International Conference on
  • Conference_Location
    Cape Town
  • ISSN
    0270-5257
  • Print_ISBN
    978-1-60558-719-6
  • Type

    conf

  • DOI
    10.1145/1806799.1806817
  • Filename
    6062077