DocumentCode :
3466060
Title :
CDIP: Collection-Driven, yet Individuality-Preserving Automated Blog Tagging
Author :
Kim, Jong Wook ; Candan, K. Selçuk ; Tatemura, Junichi
Author_Institution :
Arizona State Univ., Tempe
fYear :
2007
fDate :
17-19 Sept. 2007
Firstpage :
87
Lastpage :
94
Abstract :
With the success of blogs as popular information sharing media, searches on blogs have become popular. In the blogosphere, tagging is used as a means of annotating blog entries with contextually meaningful keywords, which enable users more easily locate blog content. Yet, although tags provided by bloggers are effective for organizing blog entries, in many cases, they are not always sufficient in properly capturing the semantics of the blog content. In our previous work, we observed that there exists large degree of content overlap (not only in the form of quotation/commentary pairs, but also as content borrowing across media outlets) among blog entries, which makes it hard for effective, discriminating keyword searches. In this paper, we further note that these implicit or explicit quotations could be leveraged to identify the contexts in which entries occur; thus, resulting in more effective tagging. Thus, we propose CDIP (a collection-driven, yet individuality- preserving tagging system) which relies on relationships provided by quotation/reuse detection and semantic-focus analysis to automatically tag the blogs in such a way that, not-only the related blogs share tags, but also individuality of the entries is preserved for discriminating tag-based accesses.
Keywords :
Web sites; automated blog tagging; information sharing media; quotation/reuse detection; semantic-focus analysis; Information services; Internet; Keyword search; Mirrors; National electric code; Organizing; Search engines; Tagging; Web pages; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing, 2007. ICSC 2007. International Conference on
Conference_Location :
Irvine, CA
Print_ISBN :
978-0-7695-2997-4
Type :
conf
DOI :
10.1109/ICSC.2007.98
Filename :
4338336
Link To Document :
بازگشت