DocumentCode
2324807
Title
Specific features of a converter of web documents from Bengali to Universal Networking Language
Author
Ali, Mohamed ; Das, Jugal Krishna ; Al-Mamun, S. M Abdullah ; Choudhury, Mihir
Author_Institution
Dept. of CSE, East West Univ., Dhaka
fYear
2008
fDate
13-15 May 2008
Firstpage
726
Lastpage
731
Abstract
In this paper, we present a workable structure along with characteristic features of a subsystem that may become an integral part of a language server bridging Bengali and the Universal Networking Language (UNL). We try to assimilate the results of the research efforts of the UNL community and also of various machine translation projects. Vast information resources in different languages are available in the Internet, but the can not be shared (because of vastly due to the language barrier). And the UNL community is set to devise an effective and efficient system to diminish that barrier with an ultimate aim to allow automatic conversion of Web based resources in one member language to that in another member language. A good number of researchers in computational linguistics all over the world have already joined hands with the UNL initiators, and research groups representing most widely used natural languages are working intensively for the purpose. This paper is to demonstrate our pioneering efforts in the field of Bengali (Bangla). Here we here outline a possible Bangla-UNL dictionary, feature an annotation editor for Bangla texts, infer significant morphological, syntactic and semantic rules for parsing Bangla web documents in connection with conversion to the UNL, and show possible ways of future contribution towards the goal.
Keywords
Internet; computational linguistics; document handling; language translation; natural language processing; Bangla Web documents; Internet; Universal Networking Language; Web based resources; automatic conversion; computational linguistics; language server; machine translation projects; natural languages; parsing; workable structure; Computational linguistics; Computer networks; Costs; Dictionaries; Information resources; Internet; Natural languages; Network servers; Scattering; Web server; Bangla-UNL Dictionary; Deconverter; Enconverter; Hyper graph; Morphological Analysis; Universal Networking Language (UNL); Universal Words (UW);
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Communication Engineering, 2008. ICCCE 2008. International Conference on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4244-1691-2
Electronic_ISBN
978-1-4244-1692-9
Type
conf
DOI
10.1109/ICCCE.2008.4580700
Filename
4580700
Link To Document