Title :
Construction of a large-scale Sino-Vietnamese bilingual parallel corpus
Author :
Lin Luo ; Jian-Yi Guo ; Zheng-Tao Yu ; Yuan-yuan Mo ; Lan-Jiang Zhou
Author_Institution :
Sch. of Inf. Eng. & Autom., Kunming Univ. of Sci. & Technol., Kunming, China
Abstract :
Bilingual parallel corpus forms an important foundation for language resources and plays an increasingly important role in the study of language and machine translation research. By studying the language features of Chinese bilingual text, this paper introduces the construction of a large-scale Sino Vietnamese bilingual parallel corpus process in detail, including collection, sorting, storage of Chinese and Vietnamese language, and on the basis of the annotation, processing, processing more Chinese bilingual corpus, so as to realize the construction of Chinese and Vietnamese bilingual parallel corpus. Developing this work deeply will promote the research and application of technology development of the relevant theories.
Keywords :
language translation; natural language processing; text analysis; Chinese bilingual corpus annotation; Chinese bilingual corpus processing; Chinese bilingual text; Chinese language; Vietnamese language; language collection; language resources; language sorting; language storage; language translation; large-scale Sino-Vietnamese bilingual parallel corpus; machine translation; Art; Educational institutions; Presses; Software; Alignment System; Corpus building; Vietnamese Chinese; bilingual parallel corpus;
Conference_Titel :
System Science and Engineering (ICSSE), 2014 IEEE International Conference on
Conference_Location :
Shanghai
DOI :
10.1109/ICSSE.2014.6887924