• DocumentCode
    1267826
  • Title

    Evaluating Term Weighting Schemes for Content-based Tag Recommendation in Social Tagging Systems

  • Author

    Olvera, E.P. ; Godoy, D.

  • Author_Institution
    Inst. de Inf., Univ. Tec. Estatal de Quevedo, Quevedo, Ecuador
  • Volume
    10
  • Issue
    4
  • fYear
    2012
  • fDate
    6/1/2012 12:00:00 AM
  • Firstpage
    1973
  • Lastpage
    1980
  • Abstract
    Social tagging systems allow users to publish different type of resources, such as Web pages or pictures, annotate them using keywords or tags and share their resources with other users. These systems achieved widespread success on the Web on account of the simplicity for organizing resources using open-ended tags. Recently, tag recommendation strategies have been proposed to alleviate the problems of ambiguity, syntactic variations and noise in tags cause by the inherent characteristics of natural language. In this work we proposed a content-based approach that generates a list of suggested tags for annotating a given resource starting from an analysis of its textual content exclusively. Thus, the proposed method can be used in situations in which there is not enough information for creating a tag-based user profile or compare the user with others. For extracting the more relevant words different term weighting approaches were evaluated, particularly considering the HTML structure of Web pages and the grammatical category of words in order to determine promising tag candidates. Experimental results of applying this technique to tag recommendation using several term weighting approaches are reported and compared.
  • Keywords
    Web sites; grammars; hypermedia markup languages; natural language processing; text analysis; HTML structure; Web page; content-based approach; content-based tag recommendation; grammatical category; natural language; open-ended tag; resource annotation; social tagging system; syntactic variation; tag noise; tag-based user profile; term weighting scheme; textual content; word; Abstracts; HTML; Organizing; Syntactics; Tagging; Vectors; Web pages; Social tagging systems; content-based recommendation; text mining;
  • fLanguage
    English
  • Journal_Title
    Latin America Transactions, IEEE (Revista IEEE America Latina)
  • Publisher
    ieee
  • ISSN
    1548-0992
  • Type

    jour

  • DOI
    10.1109/TLA.2012.6272482
  • Filename
    6272482