Title : 
Scalable construction of topic directory with nonparametric closed termset mining
         
        
            Author : 
Yu, Hwanjo ; Searsmith, Duane ; Li, Xiaolei ; Han, Jiawei
         
        
            Author_Institution : 
Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA
         
        
        
        
        
        
            Abstract : 
A topic directory, e.g., Yahoo directory, provides a view of a document set at different levels of abstraction and is ideal for the interactive exploration and visualization of the document set. We present a method that dynamically generates a topic directory from a document set using a frequent closed termset mining algorithm. Our method shows experimental results of equal quality to recent document clustering methods and has additional benefits such as automatic generation of topic labels and determination of a clustering parameter.
         
        
            Keywords : 
data mining; document handling; pattern clustering; Yahoo directory; automatic generation; document clustering; hierarchical clustering; nonparametric closed termset mining; topic directory; Clustering algorithms; Clustering methods; Computer science; Data mining; Itemsets; Organizing; Permission; Taxonomy; Tree graphs; Visualization; document clustering; hierarchical clustering; topic directory;
         
        
        
        
            Conference_Titel : 
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
         
        
            Print_ISBN : 
0-7695-2142-8
         
        
        
            DOI : 
10.1109/ICDM.2004.10056