Title : 
Clustering XML Documents Based on the Weight of Frequent Structures
         
        
            Author : 
Hwang, Jeong Hee ; Gu, Mi Sug
         
        
            Author_Institution : 
Namseoul Univ., Chonan
         
        
        
        
        
        
            Abstract : 
The previous clustering methods of XML document group XML documents with similar structures, measuring structural similarity and distance between XML documents. In this paper, however, we propose a novel clustering method for XML documents using the weight of frequent structures in XML documents, considering that an XML document as a transaction and the extracted structures from XML documents as items of a transaction. Our experiment results show the high speed and cluster cohesion of our clustering method.
         
        
            Keywords : 
XML; document handling; XML documents; clustering methods; structural similarity; Bioinformatics; Books; Clustering algorithms; Clustering methods; Computer science; Databases; Information technology; Internet; Laboratories; XML;
         
        
        
        
            Conference_Titel : 
Convergence Information Technology, 2007. International Conference on
         
        
            Conference_Location : 
Gyeongju
         
        
            Print_ISBN : 
0-7695-3038-9
         
        
        
            DOI : 
10.1109/ICCIT.2007.101