Title :
Knowledge source construction in data-oriented English-Chinese machine translation
Author :
Zhang, Yuejie ; Zhang, Tao
Author_Institution :
Dept. of Comput. Sci. & Eng., Fudan Univ., Shanghai, China
fDate :
30 Oct.-1 Nov. 2005
Abstract :
In data-oriented English-Chinese machine translation, knowledge source is the very important basis for translation processing. This paper presents a kind of construction strategy for knowledge source which contains affluent grammatical and syntactical information. Firstly, taking lexical function grammar as the theoretical basis, treebank including parse trees converted from every sentence in the source language corpus is acquired. Secondly, based on the decomposition algorithm, the corresponding fragment-bank composed of all the legal fragments extracted from the treebank is constructed. Finally, based on the combination algorithm, the fragment-combination-bank including all the possible fragment-combination forms of every parse tree in the treebank is built. Based on the successful construction of the knowledge source, the whole machine translation process can be implemented efficiently and accurately.
Keywords :
computational linguistics; grammars; language translation; natural languages; data-oriented English-Chinese machine translation; knowledge source construction; legal fragment extraction; parse trees; treebank; Computer science; Data engineering; Data mining; Finance; Humans; Knowledge engineering; Laboratories; Law; Legal factors; Tagging;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598771