• DocumentCode
    1663038
  • Title

    Automatic Annotation of Non-English Web Content

  • Author

    Sevcech, Jakub ; Bielikov, M.

  • Author_Institution
    Inst. of Inf. & Software Eng., Slovak Univ. of Technol. in Bratislava, Bratislava, Slovakia
  • Volume
    3
  • fYear
    2011
  • Firstpage
    281
  • Lastpage
    284
  • Abstract
    Nowadays we are facing the daily information overload. It is thus difficult to get exactly the information we need. It often happens that while reading, we find a word we do not understand and we would need an explanation or some additional information about this word. For this purpose annotations in the Web environment are created and attached to such words. In this paper we propose a method for an automatic extension of the content available on the Web by adding annotations to selected terms (keywords) in the text. The method is designed to be able to insert annotations into the text written in Slovak with a potential to be language independent. Annotations themselves are obtained through publicly available services providing information retrieval. We adapt created annotations taking into account implicit feedback from users in form of click through data. We evaluate the proposed method in the environment of an educational web-based system.
  • Keywords
    computer aided instruction; information retrieval; natural language processing; text analysis; Slovak text; Web content automatic extension; click through data; educational Web-based system; implicit feedback; information overload; information retrieval; language independent; nonEnglish Web content automatic annotation; Dictionaries; Encyclopedias; Internet; Shape; Web pages; Web annotation; adaptive annotations; keywords; keywords mapping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
  • Conference_Location
    Lyon
  • Print_ISBN
    978-1-4577-1373-6
  • Electronic_ISBN
    978-0-7695-4513-4
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2011.219
  • Filename
    6040860