• DocumentCode
    3625822
  • Title

    A Fast, Feature-based Cluster Algorithm for Information Retrieval

  • Author

    Martin Mehlitz;Christian Bauckhage;Sahin Albayrak

  • Author_Institution
    Technical University Berlin, DAI-Lab, Berlin, Germany. martin.mehlitz@dai-labor.de
  • fYear
    2007
  • Firstpage
    335
  • Lastpage
    341
  • Abstract
    The Internet is a vast resource of information. Unfortunately, finding and accessing this information is often a very cumbersome task even with existing information platforms. Searching on the WWW suffers from the fact that almost every word is ambiguous to a certain degree in the information-rich environment of the Internet. Clustering search results is a way to solve this problem. This paper introduces a novel, fast way to cluster documents based on frequent term sets.
  • Keywords
    "Clustering algorithms","Information retrieval","Internet","Search engines","Laboratories","World Wide Web","Clustering methods","Matrix decomposition","Singular value decomposition","Web pages"
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
  • Print_ISBN
    1-4244-1499-7
  • Type

    conf

  • DOI
    10.1109/IRI.2007.4296643
  • Filename
    4296643