DocumentCode
1267826
Title
Evaluating Term Weighting Schemes for Content-based Tag Recommendation in Social Tagging Systems
Author
Olvera, E.P. ; Godoy, D.
Author_Institution
Inst. de Inf., Univ. Tec. Estatal de Quevedo, Quevedo, Ecuador
Volume
10
Issue
4
fYear
2012
fDate
6/1/2012 12:00:00 AM
Firstpage
1973
Lastpage
1980
Abstract
Social tagging systems allow users to publish different type of resources, such as Web pages or pictures, annotate them using keywords or tags and share their resources with other users. These systems achieved widespread success on the Web on account of the simplicity for organizing resources using open-ended tags. Recently, tag recommendation strategies have been proposed to alleviate the problems of ambiguity, syntactic variations and noise in tags cause by the inherent characteristics of natural language. In this work we proposed a content-based approach that generates a list of suggested tags for annotating a given resource starting from an analysis of its textual content exclusively. Thus, the proposed method can be used in situations in which there is not enough information for creating a tag-based user profile or compare the user with others. For extracting the more relevant words different term weighting approaches were evaluated, particularly considering the HTML structure of Web pages and the grammatical category of words in order to determine promising tag candidates. Experimental results of applying this technique to tag recommendation using several term weighting approaches are reported and compared.
Keywords
Web sites; grammars; hypermedia markup languages; natural language processing; text analysis; HTML structure; Web page; content-based approach; content-based tag recommendation; grammatical category; natural language; open-ended tag; resource annotation; social tagging system; syntactic variation; tag noise; tag-based user profile; term weighting scheme; textual content; word; Abstracts; HTML; Organizing; Syntactics; Tagging; Vectors; Web pages; Social tagging systems; content-based recommendation; text mining;
fLanguage
English
Journal_Title
Latin America Transactions, IEEE (Revista IEEE America Latina)
Publisher
ieee
ISSN
1548-0992
Type
jour
DOI
10.1109/TLA.2012.6272482
Filename
6272482
Link To Document