• DocumentCode
    3587359
  • Title

    Automatic Classification of UML Class Diagrams from Images

  • Author

    Truong Ho-Quang ; Chaudron, Michel R. V. ; Samuelsson, Ingimar ; Hjaltason, Joel ; Karasneh, Bilal ; Osman, Hafeez

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Chalmers Univ. of Technol., Gothenburg, Sweden
  • Volume
    1
  • fYear
    2014
  • Firstpage
    399
  • Lastpage
    406
  • Abstract
    Graphical modelling of various aspects of software and systems is a common part of software development. UML is the de-facto standard for various types of software models. To be able to research UML, academia needs to have a corpus of UML models. For building such a database, an automated system that has the ability to classify UML class diagram images would be very beneficial, since a large portion of UML class diagrams (UML CDs) is available as images on the Internet. In this study, we propose 23 image-features and investigate the use of these features for the purpose of classifying UML CD images. We analyse the performance of the features and assess their contribution based on their Information Gain Attribute Evaluation scores. We study specificity and sensitivity scores of six classification algorithms on a set of 1300 images. We found that 19 out of 23 introduced features can be considered as influential predictors for classifying UML CD images. Through the six algorithms, the prediction rate achieves nearly 96% correctness for UML-CD and 91% of correctness for non-UML CD.
  • Keywords
    Internet; Unified Modeling Language; image classification; software engineering; Internet; UML CD image classification; UML class diagram images; UML models; automated system; automatic classification; defacto standard; graphical modelling; information gain attribute evaluation scores; sensitivity scores; software development; software models; Classification algorithms; Data mining; Feature extraction; Shape; Training; Unified modeling language; Software Engineering; UML; UML class diagram; classification; feature extraction; machine learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering Conference (APSEC), 2014 21st Asia-Pacific
  • ISSN
    1530-1362
  • Print_ISBN
    978-1-4799-7425-2
  • Type

    conf

  • DOI
    10.1109/APSEC.2014.65
  • Filename
    7091336