Title :
On the use of machine translation for spoken language understanding portability
Author :
Servan, Christophe ; Camelin, Nathalie ; Raymond, Christian ; Bechet, Frederic ; de Mori, Renato
Author_Institution :
LIA-CERI, Univ. d´´Avignon et des Pays de Vaucluse, Avignon, France
Abstract :
Across language portability of a spoken language understanding system (SLU) deals with the possibility of reusing with moderate effort in a new language knowledge and data acquired for another language. The approach proposed in this paper is motivated by the availability of the fairly large MEDIA corpus carefully transcribed in French and semantically annotated in terms of constituents. A method is proposed for manually translating a portion of the training set for training an automatic machine translation (MT) system to be used for translating the remaining data. As the source language is annotated in terms of concept tags, a solution is presented for automatically transferring these tags to the translated corpus. Experimental results are presented on the accuracy of the translation expressed with the BLEU score as function of the size of the training corpus. It is shown that the process leads to comparable concept error rates in the two languages making the proposed approach suitable for SLU portability across languages.
Keywords :
language translation; natural language processing; text analysis; BLEU score; French; MEDIA training corpus; SLU portability; automatic machine translation; spoken language understanding portability; Availability; Classification tree analysis; Contracts; Error analysis; Humans; Natural languages; Size measurement; Stochastic processes; Testing; Dialog Systems; Portability across languages; Spoken Language Understanding;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5494960