Title : 
A Probabilistic Based Approach towards Software System Clustering
         
        
            Author : 
Corazza, Anna ; Di Martino, Sergio ; Scanniello, Giuseppe
         
        
            Author_Institution : 
Dipt. di Sci. Fis., Sezione Inf., Univ. of Naples Federico II, Naples, Italy
         
        
        
        
        
        
            Abstract : 
In this paper we present a clustering based approach to partition software systems into meaningful subsystems. In particular, the approach uses lexical information extracted from four zones in Java classes, which may provide a different contribution towards software systems partitioning. To automatically weigh these zones, we introduced a probabilistic model, and applied the Expectation-Maximization (EM) algorithm. To group classes according to the considered lexical information, we customized the well-known K-Medoids algorithm. To assess the approach and the implemented supporting system, we have conducted a case study on six open source software systems.
         
        
            Keywords : 
Java; expectation-maximisation algorithm; inference mechanisms; pattern clustering; public domain software; software architecture; software maintenance; Java; K-Medoids algorithm; expectation maximization algorithm; lexical information extraction; open source software system; probabilistic approach; software partitioning; software system clustering; Clustering algorithms; Data mining; Partitioning algorithms; Probabilistic logic; Software algorithms; Software systems; Architecture Recovery; Clustering; Probabilistic Model; Reverse Engineering; Software Partitioning;
         
        
        
        
            Conference_Titel : 
Software Maintenance and Reengineering (CSMR), 2010 14th European Conference on
         
        
            Conference_Location : 
Madrid
         
        
        
            Print_ISBN : 
978-1-61284-369-8
         
        
            Electronic_ISBN : 
1534-5351
         
        
        
            DOI : 
10.1109/CSMR.2010.36