DocumentCode
1851602
Title
Rule-Based and Example-Based Machine Translation from English to Arabic
Author
Alawneh, Mouiad Fadiel ; Sembok, Tengku Mohd
Author_Institution
Fac. of Inf. Sci. & T.echnol., Nat. Univ. of Malaysia, Bangi, Malaysia
fYear
2011
fDate
27-29 Sept. 2011
Firstpage
343
Lastpage
347
Abstract
Machine Translation has been defined as the process that utilizes computer software to translate text from one natural language to another. This definition involves accounting for the grammatical structure of each language and using rules, examples and grammars to transfer the grammatical structure of the source language (SL) into the target language (TL). This paper presents English to Arabic approach for translating well-structured English sentences into well-structured Arabic sentences, using a Grammar based and example-translation techniques to handle the problems of ordering and agreement. The proposed methodology is flexible and scalable, the main advantages are: first, a hybrid-based approach combined advantages of rule-based (RBMT) with advantages example-based (EBMT), and second, it can be applied on some other languages with minor modifications. The OAK Parser is used to analyze the input English text to get the part of speech (POS) for each word in the text as a pre-translation process using the C# language, validation rules have been applied in both the database design and the programming code in order to ensure the integrity of data. A major design goal of this system is that it will be used as a stand-alone tool, and can be very well integrated with a general machine translation system for English sentences.
Keywords
computer software; data integrity; database management systems; grammars; language translation; natural languages; Arabic sentences; C# language; English sentences; English to Arabic approach; OAK parser; computer software; data integrity; database design; example-based machine translation; general machine translation system; grammatical structure; hybrid-based approach; natural language; part of speech; pretranslation process; programming code; rule-based machine translation; source language; target language; validation rules; Databases; Educational institutions; Google; Grammar; Helium; Pragmatics; Syntactics; Agreement; Example-based; Hybrid-based OAK; MT; POS; Parser; Rule-Based; Word reorder;
fLanguage
English
Publisher
ieee
Conference_Titel
Bio-Inspired Computing: Theories and Applications (BIC-TA), 2011 Sixth International Conference on
Conference_Location
Penang
Print_ISBN
978-1-4577-1092-6
Type
conf
DOI
10.1109/BIC-TA.2011.76
Filename
6046925
Link To Document