• DocumentCode
    2324807
  • Title

    Specific features of a converter of web documents from Bengali to Universal Networking Language

  • Author

    Ali, Mohamed ; Das, Jugal Krishna ; Al-Mamun, S. M Abdullah ; Choudhury, Mihir

  • Author_Institution
    Dept. of CSE, East West Univ., Dhaka
  • fYear
    2008
  • fDate
    13-15 May 2008
  • Firstpage
    726
  • Lastpage
    731
  • Abstract
    In this paper, we present a workable structure along with characteristic features of a subsystem that may become an integral part of a language server bridging Bengali and the Universal Networking Language (UNL). We try to assimilate the results of the research efforts of the UNL community and also of various machine translation projects. Vast information resources in different languages are available in the Internet, but the can not be shared (because of vastly due to the language barrier). And the UNL community is set to devise an effective and efficient system to diminish that barrier with an ultimate aim to allow automatic conversion of Web based resources in one member language to that in another member language. A good number of researchers in computational linguistics all over the world have already joined hands with the UNL initiators, and research groups representing most widely used natural languages are working intensively for the purpose. This paper is to demonstrate our pioneering efforts in the field of Bengali (Bangla). Here we here outline a possible Bangla-UNL dictionary, feature an annotation editor for Bangla texts, infer significant morphological, syntactic and semantic rules for parsing Bangla web documents in connection with conversion to the UNL, and show possible ways of future contribution towards the goal.
  • Keywords
    Internet; computational linguistics; document handling; language translation; natural language processing; Bangla Web documents; Internet; Universal Networking Language; Web based resources; automatic conversion; computational linguistics; language server; machine translation projects; natural languages; parsing; workable structure; Computational linguistics; Computer networks; Costs; Dictionaries; Information resources; Internet; Natural languages; Network servers; Scattering; Web server; Bangla-UNL Dictionary; Deconverter; Enconverter; Hyper graph; Morphological Analysis; Universal Networking Language (UNL); Universal Words (UW);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Communication Engineering, 2008. ICCCE 2008. International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4244-1691-2
  • Electronic_ISBN
    978-1-4244-1692-9
  • Type

    conf

  • DOI
    10.1109/ICCCE.2008.4580700
  • Filename
    4580700