• DocumentCode
    2824693
  • Title

    Analyzing Term Weighting Schemes for Labeling Software Clusters

  • Author

    Siddique, Faiza ; Maqbool, Onaiza

  • Author_Institution
    Dept. of Comput. Sci., Quaid-i-Azam Univ., Islamabad, Pakistan
  • fYear
    2011
  • fDate
    1-4 March 2011
  • Firstpage
    85
  • Lastpage
    88
  • Abstract
    Clustering techniques have been widely employed for software modularization. The clusters formed as a result of the clustering process may be difficult to understand unless they are appropriately labeled. One method to assign labels is to use term weighting schemes from Information Retrieval and Text Categorization which use weights to assign importance to terms in a document. Some of these term weighting schemes have been used by researchers for labeling clusters, but there is a need to compare various schemes and analyze their strengths and weaknesses. In this paper, we analyze four different schemes in the context of software and identify cases where one may be better than the other. We also conduct experiments to verify the behavior of the weighting schemes according to software characteristics.
  • Keywords
    pattern clustering; software engineering; clustering techniques; information retrieval; software cluster labelling; software modularization; term weighting scheme analysis; text categorization; Algorithm design and analysis; Clustering algorithms; Labeling; Radio frequency; Software algorithms; Software systems; Software Cluster Labeling; Software Clustering; Term Weighting Schemes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance and Reengineering (CSMR), 2011 15th European Conference on
  • Conference_Location
    Oldenburg
  • ISSN
    1534-5351
  • Print_ISBN
    978-1-61284-259-2
  • Type

    conf

  • DOI
    10.1109/CSMR.2011.13
  • Filename
    5741249