DocumentCode :
2344664
Title :
An Improved Similarity Measure for Binary Features in Software Clustering
Author :
Naseem, Rashid ; Maqbool, Onaiza ; Muhammad, Siraj
Author_Institution :
Dept. of Comput. Sci., Quaid-I-Azam Univ., Islamabad, Pakistan
fYear :
2010
fDate :
28-30 Sept. 2010
Firstpage :
111
Lastpage :
116
Abstract :
In recent years, there has been increasing interest in exploring clustering as a technique to recover the architecture of software systems. The efficacy of clustering depends not only on the clustering algorithm, but also on the choice of entities, features and similarity measures used during clustering. It is also important to understand characteristics of the domain in which clustering is being applied, since the performance of different measures and algorithms may vary depending on these characteristics. In the software domain, the Jaccard similarity measure gives better results as compared to other similarity measures for binary features. In this paper, we highlight cases where the Jaccard measure may fail to capture similarity between entities appropriately. We propose a new similarity measure which overcomes these deficiencies. Our experimental results indicate the better performance of the new similarity measure for software systems exhibiting the defined characteristics.
Keywords :
pattern clustering; software architecture; software maintenance; software metrics; Jaccard similarity measure; binary features; software clustering; software domain; software system architecture; Arbitrary decisions; Binary Features; Jaccard Measure; Jaccard- NM Measure; Software Clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence, Modelling and Simulation (CIMSiM), 2010 Second International Conference on
Conference_Location :
Bali
Print_ISBN :
978-1-4244-8652-6
Electronic_ISBN :
978-0-7695-4262-1
Type :
conf
DOI :
10.1109/CIMSiM.2010.34
Filename :
5701830
Link To Document :
بازگشت