DocumentCode :
2654930
Title :
Extracting semantic annotations and their correlation with document components
Author :
Ahmed, Nabeel ; Khan, Sharifullah ; Latif, Khalid ; Masood, Asad
Author_Institution :
Sch. of Electr. Eng. & Comput. Sci., NUST, Islamabad
fYear :
2008
fDate :
18-19 Oct. 2008
Firstpage :
32
Lastpage :
37
Abstract :
Digital repositories can preserve terabytes of information in the form of digital documents. Searching these digital documents requires time and computing resources. Techniques are required to efficiently process these digital repositories. Metadata and semantic annotations can augment the overall search process and provide a foundation to build intelligent applications by using the documents in the repository. In this paper, we are proposing an approach for generation of context aware metadata to enhance search for the scientific publications and also prove the impact of compound words on semantic metadata. Major contribution of our work is to correlate the extracted semantic annotations with the document components. This allows, for example, searching a document centered around a scientific claim by differentiating between author´s claims and statements about related systems mentioned in different document components. The approach utilizes the syntactic and semantic measures to increase the quality of the extracted semantic annotations and to bring improvements in precision of search results.
Keywords :
document handling; information retrieval; meta data; semantic Web; ubiquitous computing; compound words; computing resources; context aware metadata; digital documents; digital repository; document components; scientific publications; semantic annotations; semantic metadata; Computer science; Context awareness; Data mining; Information retrieval; Moore´s Law; Natural language processing; Proteins; Search engines; Web search; Writing; Digital Repositries; Document Components; Semantic Network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Technologies, 2008. ICET 2008. 4th International Conference on
Conference_Location :
Rawalpindi
Print_ISBN :
978-1-4244-2210-4
Electronic_ISBN :
978-1-4244-2211-1
Type :
conf
DOI :
10.1109/ICET.2008.4777470
Filename :
4777470
Link To Document :
بازگشت