DocumentCode
3625822
Title
A Fast, Feature-based Cluster Algorithm for Information Retrieval
Author
Martin Mehlitz;Christian Bauckhage;Sahin Albayrak
Author_Institution
Technical University Berlin, DAI-Lab, Berlin, Germany. martin.mehlitz@dai-labor.de
fYear
2007
Firstpage
335
Lastpage
341
Abstract
The Internet is a vast resource of information. Unfortunately, finding and accessing this information is often a very cumbersome task even with existing information platforms. Searching on the WWW suffers from the fact that almost every word is ambiguous to a certain degree in the information-rich environment of the Internet. Clustering search results is a way to solve this problem. This paper introduces a novel, fast way to cluster documents based on frequent term sets.
Keywords
"Clustering algorithms","Information retrieval","Internet","Search engines","Laboratories","World Wide Web","Clustering methods","Matrix decomposition","Singular value decomposition","Web pages"
Publisher
ieee
Conference_Titel
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
Print_ISBN
1-4244-1499-7
Type
conf
DOI
10.1109/IRI.2007.4296643
Filename
4296643
Link To Document