DocumentCode :
3337928
Title :
Neural Network Language Models for Translation with Limited Data
Author :
Khalilov, Maxim ; Fonollosa, José A R ; Zamora-Martinez, Francisco ; Castro-Bleda, M.J. ; Espaa-Boquera, S.
Author_Institution :
Centre de Recerca TALP, Univ. Politec. de Catalunya, Barcelona
Volume :
2
fYear :
2008
fDate :
3-5 Nov. 2008
Firstpage :
445
Lastpage :
451
Abstract :
In this paper we present how to estimate a continuous space language model with a neural network to be used in a statistical machine translation system. We report results for an Italian-English translation task obtained on a small corpus (about 150 K tokens), that can be considered a task with a lack of training data. Different word history length included in the connectionist language model (n-gram order) and distinct continuous space representation (i.e. words appearing in the training corpus more than k times) are considered in the study. The experimental results are evaluated by means of automatic evaluation metrics correlated with fluency and adequacy of the generated translations.
Keywords :
language translation; learning (artificial intelligence); natural language processing; neural nets; statistical analysis; Italian-English translation; continuous space connectionist language model estimation; neural network training; statistical machine translation system; Clustering algorithms; Context modeling; Encoding; History; Natural languages; Neural networks; Power system modeling; Surface-mount technology; Training data; Vocabulary; continous space language model; language models; machine translation; neural networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2008. ICTAI '08. 20th IEEE International Conference on
Conference_Location :
Dayton, OH
ISSN :
1082-3409
Print_ISBN :
978-0-7695-3440-4
Type :
conf
DOI :
10.1109/ICTAI.2008.35
Filename :
4669807
Link To Document :
بازگشت