Title :
Neural Network Language Models for Translation with Limited Data
Author :
Khalilov, Maxim ; Fonollosa, José A R ; Zamora-Martinez, Francisco ; Castro-Bleda, M.J. ; Espaa-Boquera, S.
Author_Institution :
Centre de Recerca TALP, Univ. Politec. de Catalunya, Barcelona
Abstract :
In this paper we present how to estimate a continuous space language model with a neural network to be used in a statistical machine translation system. We report results for an Italian-English translation task obtained on a small corpus (about 150 K tokens), that can be considered a task with a lack of training data. Different word history length included in the connectionist language model (n-gram order) and distinct continuous space representation (i.e. words appearing in the training corpus more than k times) are considered in the study. The experimental results are evaluated by means of automatic evaluation metrics correlated with fluency and adequacy of the generated translations.
Keywords :
language translation; learning (artificial intelligence); natural language processing; neural nets; statistical analysis; Italian-English translation; continuous space connectionist language model estimation; neural network training; statistical machine translation system; Clustering algorithms; Context modeling; Encoding; History; Natural languages; Neural networks; Power system modeling; Surface-mount technology; Training data; Vocabulary; continous space language model; language models; machine translation; neural networks;
Conference_Titel :
Tools with Artificial Intelligence, 2008. ICTAI '08. 20th IEEE International Conference on
Conference_Location :
Dayton, OH
Print_ISBN :
978-0-7695-3440-4
DOI :
10.1109/ICTAI.2008.35