DocumentCode :
2869932
Title :
Enriching reverse engineering with semantic clustering
Author :
Kuhn, Adrian ; Ducasse, Stéphane ; Gîrba, Tudor
Author_Institution :
Software Composition Group, Berne Univ., Switzerland
fYear :
2005
fDate :
7-11 Nov. 2005
Abstract :
Understanding a software system by just analyzing the structure of the system reveals only half of the picture, since the structure tells us only how the code is working but not what the code is about. What the code is about can be found in the semantics of the source code: names of identifiers, comments etc. In this paper, we analyze how these terms are spread over the source artifacts using latent semantic indexing, an information retrieval technique. We use the assumption that parts of the system that use similar terms are related. We cluster artifacts that use similar terms, and we reveal the most relevant terms for the computed clusters. Our approach works at the level of the source code which makes it language independent. Nevertheless, we correlated the semantics with structural information and we applied it at different levels of abstraction (e.g. classes, methods). We applied our approach on three large case studies and we report the results we obtained.
Keywords :
formal specification; indexing; information retrieval; program diagnostics; programming language semantics; reverse engineering; structured programming; artifacts clustering; information retrieval; latent semantic indexing; reverse engineering; semantic clustering; software system; source code semantic; system structure; Computational modeling; Computer simulation; Indexing; Information analysis; Information retrieval; Large scale integration; Reverse engineering; Software systems; Vocabulary; Web server; clustering; concept location; reverse engineering; semantic analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reverse Engineering, 12th Working Conference on
ISSN :
1095-1350
Print_ISBN :
0-7695-2474-5
Type :
conf
DOI :
10.1109/WCRE.2005.16
Filename :
1566153
Link To Document :
بازگشت