Title :
A Fast Decoder Using Less Memory
Author :
Minh, Hien Vo ; Dinh, Dien ; Hong, Nhung Nguyen Thi
Author_Institution :
Fac. of Inf. Technol., Univ. of Sci., Ho Chi Minh City, Vietnam
Abstract :
Statistical Machine Translation (SMT) uses large amount of text corpus and complex calculation operation for translation process, which makes this method require more system resources for fast translation. In this paper, we introduce an approach of decoding in SMT using less memory but translating faster, which is more suitable for mobile applications and embedded systems. In our approach, the SMT models are stored in tree structures in order to speed up the loading process and the decoding algorithm is optimized to reduce operations. We apply our approach to English-Vietnamese and Vietnamese-English SMT systems. When translating 20,000 English sentences, which are 7.45 word lengths in average, we achieve 37.8 BLEU score, the average speed is 0.052 s. In case of Vietnamese-English system, we translate 20,000 Vietnamese sentences, which are 8.42 word lengths in average, the BLEU score is 34.63 with an average speed of 0.091 s.
Keywords :
decoding; language translation; natural language processing; statistical analysis; text analysis; tree data structures; BLEU score; English sentence translation; English-Vietnamese SMT systems; Vietnamese sentence translation; Vietnamese-English SMT systems; complex calculation operation; decoding algorithm optimization; embedded systems; loading process; mobile applications; statistical machine translation; system resources; text corpus; tree structures; word lengths; Arrays; Data models; Decoding; Indexes; Load modeling; Mathematical model; Memory management; decoding algorithm; embedded system; improve decoding; mobile application; statistical machine translation;
Conference_Titel :
Knowledge and Systems Engineering (KSE), 2012 Fourth International Conference on
Conference_Location :
Danang
Print_ISBN :
978-1-4673-2171-6
DOI :
10.1109/KSE.2012.11