DocumentCode
2654930
Title
Extracting semantic annotations and their correlation with document components
Author
Ahmed, Nabeel ; Khan, Sharifullah ; Latif, Khalid ; Masood, Asad
Author_Institution
Sch. of Electr. Eng. & Comput. Sci., NUST, Islamabad
fYear
2008
fDate
18-19 Oct. 2008
Firstpage
32
Lastpage
37
Abstract
Digital repositories can preserve terabytes of information in the form of digital documents. Searching these digital documents requires time and computing resources. Techniques are required to efficiently process these digital repositories. Metadata and semantic annotations can augment the overall search process and provide a foundation to build intelligent applications by using the documents in the repository. In this paper, we are proposing an approach for generation of context aware metadata to enhance search for the scientific publications and also prove the impact of compound words on semantic metadata. Major contribution of our work is to correlate the extracted semantic annotations with the document components. This allows, for example, searching a document centered around a scientific claim by differentiating between author´s claims and statements about related systems mentioned in different document components. The approach utilizes the syntactic and semantic measures to increase the quality of the extracted semantic annotations and to bring improvements in precision of search results.
Keywords
document handling; information retrieval; meta data; semantic Web; ubiquitous computing; compound words; computing resources; context aware metadata; digital documents; digital repository; document components; scientific publications; semantic annotations; semantic metadata; Computer science; Context awareness; Data mining; Information retrieval; Moore´s Law; Natural language processing; Proteins; Search engines; Web search; Writing; Digital Repositries; Document Components; Semantic Network;
fLanguage
English
Publisher
ieee
Conference_Titel
Emerging Technologies, 2008. ICET 2008. 4th International Conference on
Conference_Location
Rawalpindi
Print_ISBN
978-1-4244-2210-4
Electronic_ISBN
978-1-4244-2211-1
Type
conf
DOI
10.1109/ICET.2008.4777470
Filename
4777470
Link To Document