A Fast, Feature-based Cluster Algorithm for Information Retrieval

Author

Martin Mehlitz;Christian Bauckhage;Sahin Albayrak

Author_Institution

Technical University Berlin, DAI-Lab, Berlin, Germany. martin.mehlitz@dai-labor.de

fYear

2007

Firstpage

335

Lastpage

341

Abstract

The Internet is a vast resource of information. Unfortunately, finding and accessing this information is often a very cumbersome task even with existing information platforms. Searching on the WWW suffers from the fact that almost every word is ambiguous to a certain degree in the information-rich environment of the Internet. Clustering search results is a way to solve this problem. This paper introduces a novel, fast way to cluster documents based on frequent term sets.

Keywords

"Clustering algorithms","Information retrieval","Internet","Search engines","Laboratories","World Wide Web","Clustering methods","Matrix decomposition","Singular value decomposition","Web pages"

Publisher

ieee

Conference_Titel

Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on

Print_ISBN

1-4244-1499-7

Type

conf

DOI

10.1109/IRI.2007.4296643

Filename

4296643

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3625822