DocumentCode :
480737
Title :
The Metadata Triumvirate: Social Annotations, Anchor Texts and Search Queries
Author :
Noll, Michael G. ; Meinel, Christoph
Author_Institution :
Hasso-Plattner-Inst. an der Univ. Potsdam, Potsdam
Volume :
1
fYear :
2008
fDate :
9-12 Dec. 2008
Firstpage :
640
Lastpage :
647
Abstract :
In this paper, we study and compare three different but related types of metadata about Web documents: social annotations provided by readers of Web documents, hyperlink anchor text provided by authors of Web documents, and search queries of users trying to find Web documents. We introduce a large research data set called CABS120k, which we have created for this study from a variety of information sources such as AOL500k, the Open Directory Project, del.icio.us/Yahoo!, Google and the WWW in general. We use this data set to investigate several characteristics of said metadata including length, novelty, diversity, and similarity and discuss theoretical and practical implications.
Keywords :
document handling; meta data; query processing; text analysis; Open Directory Project; Web documents; hyperlink anchor text; metadata triumvirate; search queries; social annotations; Data mining; Indexing; Information resources; Information retrieval; Intelligent agent; Privacy; Sampling methods; Search engines; Tagging; World Wide Web; anchor text; aol500k; cabs120k08; comparison; del.icio.us; delicious; google; metadata; open directory; pagerank; search query; social web; study; tagging; tags;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
Type :
conf
DOI :
10.1109/WIIAT.2008.341
Filename :
4740524
Link To Document :
بازگشت