Title of article :
Evaluating Fidelity of Persian-English Sentence-Aligned Parallel Corpus
Author/Authors :
Mashayekhi، Masoomeh نويسنده Computer Engineering Department , , Analoui ، Morteza نويسنده Computer Engineering Department ,
Issue Information :
فصلنامه با شماره پیاپی 19 سال 2013
Abstract :
Bilingual corpus is one of the most important resources for Natural Language Processing applications and
researches. The quality of bilingual corpora can influence the result of researches that used it as a resource. When
translation machine is used to verify corpus quality, the quality of translation machine can affect the evaluation of
corpus. One way for evaluating software or resources in ISO is verifying its own features. The expectation of finding
translation for each word in each sentence by using a bilingual dictionary is verified in this paper as a factor for
evaluating fidelity of corpus. Computing this expectation needed a pre-processing step that is designed with
considering the differences between English and Persian languages. This method is a combination of a rule-based
method with the information of a dictionary.
Journal title :
International Journal of Information and Communication Technology Research
Journal title :
International Journal of Information and Communication Technology Research