Title : 
A Fast, Feature-based Cluster Algorithm for Information Retrieval
         
        
            Author : 
Martin Mehlitz;Christian Bauckhage;Sahin Albayrak
         
        
            Author_Institution : 
Technical University Berlin, DAI-Lab, Berlin, Germany. martin.mehlitz@dai-labor.de
         
        
        
        
        
            Abstract : 
The Internet is a vast resource of information. Unfortunately, finding and accessing this information is often a very cumbersome task even with existing information platforms. Searching on the WWW suffers from the fact that almost every word is ambiguous to a certain degree in the information-rich environment of the Internet. Clustering search results is a way to solve this problem. This paper introduces a novel, fast way to cluster documents based on frequent term sets.
         
        
            Keywords : 
"Clustering algorithms","Information retrieval","Internet","Search engines","Laboratories","World Wide Web","Clustering methods","Matrix decomposition","Singular value decomposition","Web pages"
         
        
        
            Conference_Titel : 
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
         
        
            Print_ISBN : 
1-4244-1499-7
         
        
        
            DOI : 
10.1109/IRI.2007.4296643