Title :
Towards Recovering Architectural Concepts Using Latent Semantic Indexing
Author :
Van der Spek, Pieter ; Klusener, Steven ; van de Laar, P.
Author_Institution :
VU Univ. Amsterdam, Amsterdam
Abstract :
In order to address the problem of locating high-level concepts in source code we propose to use an advanced information retrieval method to exploit linguistic information found in source code, such as variable names and comments. Our technique is based on latent semantic indexing (LSI) which is also used in today´s search engines. Applying LSI to source code, however, is not straightforward. Our approach therefore not only includes LSI, but also several other algorithms and methods. We discuss the algorithms and methods that turned out to be useful and provide an overview of their effects using the results obtained from a case study at Philips Healthcare.
Keywords :
indexing; information retrieval; software architecture; architectural concepts; information retrieval; latent semantic indexing; linguistic information; Clustering algorithms; Embedded system; Indexing; Information retrieval; Large scale integration; Medical services; Natural language processing; Search engines; Software maintenance; Software systems; Clustering; Feature Extraction; Latent Semantic Indexing; Reverse Engineering; Software Architecture;
Conference_Titel :
Software Maintenance and Reengineering, 2008. CSMR 2008. 12th European Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4244-2157-2
DOI :
10.1109/CSMR.2008.4493321