DocumentCode :
2971464
Title :
Semantic TagPrint - Tagging and Indexing Content for Semantic Search and Content Management
Author :
Kalender, Murat ; Dang, Jiangbo ; Uskudarli, Suzan
Author_Institution :
Bogazici Univ., Istanbul, Turkey
fYear :
2010
fDate :
22-24 Sept. 2010
Firstpage :
260
Lastpage :
267
Abstract :
Existing search and content management technology is facing a challenge of locating desired content with the exponentially growing volume of documents. An approach for mitigating this issue is to make use of user-generated tags. However, the improvements are limited because tags are (1) free from context and form, (2) user generated, (3) used for purposes other than description, and (4) often ambiguous. Since tagging is a voluntary action, some documents are not tagged at all. Furthermore, the interpretation of the tags associated with tagged documents also remains a challenge. To overcome these challenges, semantic web resources and technologies can be utilized to automatically generate semantic tags. Semantic tags not only reflect document content more accurately, they also enable better search results. Ontology coverage, ontology mapping and weighting significant ontological entities within a context are key challenges in semantic tagging systems. To address these challenges, this paper presents a semantic tagging system - Semantic TagPrint - to map a text document to semantic tags defined as entities in an ontology. Semantic TagPrint uses a linear time lexical chaining Word Sense Disambiguation (WSD) algorithm for real time concept mapping. In addition, it utilizes statistical metrics and ontological features of the ontology for weighting and recommending the semantic tags. A comparative evaluation shows that our mapping algorithm is fairly accurate and our tag recommendation algorithm performs better than other systems and algorithms.
Keywords :
content management; indexing; ontologies (artificial intelligence); semantic Web; statistical analysis; text analysis; content management; document content; indexing content; linear time lexical chaining word sense disambiguation; ontological entities; ontological features; ontology coverage; ontology mapping; real time concept mapping; semantic TagPrint; semantic Web resources; semantic search; semantic tagging systems; semantic tags; statistical metrics; text document; user-generated tags; Content management; Context; Internet; Knowledge based systems; Ontologies; Semantics; Tagging; Automatic tagging; Ontology; Semantic Web; Semantic tags; content management; semantic search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing (ICSC), 2010 IEEE Fourth International Conference on
Conference_Location :
Pittsburgh, PA
Print_ISBN :
978-1-4244-7912-2
Electronic_ISBN :
978-0-7695-4154-9
Type :
conf
DOI :
10.1109/ICSC.2010.53
Filename :
5629261
Link To Document :
بازگشت