Title :
Architecture Recovery Using Latent Semantic Indexing and K-Means: An Empirical Evaluation
Author :
Risi, Michele ; Scanniello, Giuseppe ; Tortora, Genoveffa
Author_Institution :
Dipt. di Mat. e Inf., Univ. of Basilicata, Potenza, Italy
Abstract :
A number of clustering based approaches and tools have been proposed in the past to partition a software system into subsystems. The greater part of these approaches is semiautomatic, thus requiring human decision to identify the best partition of software entities into clusters among the possible partitions. In addition, some approaches are conceived for software systems implemented using a particular programming language (e.g., C and C++). In this paper we present an approach to automate the partitioning of a given software system into subsystems. In particular, the approach first analyzes the software entities (e.g., programs or classes) and then using Latent Semantic Indexing the dissimilarity between these entities is computed. Finally, software entities are grouped using iteratively the k-means clustering algorithm. The approach has been implemented in a prototype of a supporting software system as an Eclipse plug-in. Finally, to assess the approach and the plug-in, we have conducted an empirical investigation on three open source software systems implemented using the programming languages Java and C/C++.
Keywords :
C++ language; Java; indexing; pattern clustering; public domain software; software maintenance; C++; Eclipse plug-in; architecture recovery; clustering based approach; k-means clustering algorithm; latent semantic indexing; open source software systems; programming languages Java; Algorithm design and analysis; Clustering algorithms; Large scale integration; Partitioning algorithms; Software algorithms; Software systems; Clustering; Latent Semantic Indexing; Program Comprehension; Software Maintenance; Software Partitioning;
Conference_Titel :
Software Engineering and Formal Methods (SEFM), 2010 8th IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-1-4244-8289-4
DOI :
10.1109/SEFM.2010.19