• DocumentCode
    2654930
  • Title

    Extracting semantic annotations and their correlation with document components

  • Author

    Ahmed, Nabeel ; Khan, Sharifullah ; Latif, Khalid ; Masood, Asad

  • Author_Institution
    Sch. of Electr. Eng. & Comput. Sci., NUST, Islamabad
  • fYear
    2008
  • fDate
    18-19 Oct. 2008
  • Firstpage
    32
  • Lastpage
    37
  • Abstract
    Digital repositories can preserve terabytes of information in the form of digital documents. Searching these digital documents requires time and computing resources. Techniques are required to efficiently process these digital repositories. Metadata and semantic annotations can augment the overall search process and provide a foundation to build intelligent applications by using the documents in the repository. In this paper, we are proposing an approach for generation of context aware metadata to enhance search for the scientific publications and also prove the impact of compound words on semantic metadata. Major contribution of our work is to correlate the extracted semantic annotations with the document components. This allows, for example, searching a document centered around a scientific claim by differentiating between author´s claims and statements about related systems mentioned in different document components. The approach utilizes the syntactic and semantic measures to increase the quality of the extracted semantic annotations and to bring improvements in precision of search results.
  • Keywords
    document handling; information retrieval; meta data; semantic Web; ubiquitous computing; compound words; computing resources; context aware metadata; digital documents; digital repository; document components; scientific publications; semantic annotations; semantic metadata; Computer science; Context awareness; Data mining; Information retrieval; Moore´s Law; Natural language processing; Proteins; Search engines; Web search; Writing; Digital Repositries; Document Components; Semantic Network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Technologies, 2008. ICET 2008. 4th International Conference on
  • Conference_Location
    Rawalpindi
  • Print_ISBN
    978-1-4244-2210-4
  • Electronic_ISBN
    978-1-4244-2211-1
  • Type

    conf

  • DOI
    10.1109/ICET.2008.4777470
  • Filename
    4777470