• DocumentCode
    480676
  • Title

    TagScore: Approximate Similarity Using Tag Synopses

  • Author

    Penev, Alex ; Wong, Raymond K.

  • Author_Institution
    NICTA, Univ. of New South Wales, Sydney, NSW
  • Volume
    1
  • fYear
    2008
  • fDate
    9-12 Dec. 2008
  • Firstpage
    98
  • Lastpage
    104
  • Abstract
    Collaborative tagging is the aggregate effort by a community of online users to annotate web content with metadata labels called tags. It is a simple activity that enriches our knowledge about digital content, and has gained popularity with services such as Del.icio.us. Del.icio.us has a large repository that evolves daily, presenting interesting new problems for IR. We present TagScore, a scoring function to rate the goodness of Del.icio.us tags for their associated web page. It gives us a succinct synopsis for a page that we can use to efficiently find similar pages. Using real Del.icio.us data, we show that our approach gives good correlation to cosine similarity but is several hundred times faster and requires minimal storage overhead.
  • Keywords
    Web sites; identification technology; Del.icio.us; TagScore; Web content; approximate similarity; associated web page; collaborative tagging; digital content; tag synopses; Aggregates; Australia; Filters; Frequency; Intelligent agent; International collaboration; Large-scale systems; Online Communities/Technical Collaboration; Tagging; Web pages; collaborative tagging; del.icio.us; metadata; social bookmarking; web2.0;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-0-7695-3496-1
  • Type

    conf

  • DOI
    10.1109/WIIAT.2008.158
  • Filename
    4740432