DocumentCode
591900
Title
Combining multiple translation systems for Spoken Language Understanding portability
Author
Garcia, Francisco ; Hurtado, L.F. ; Segarra, E. ; Sanchis, E. ; Riccardi, Giuseppe
Author_Institution
Dept. Sist. Inf. i Computacio, Univ. Politec. de Valencia, Valencia, Spain
fYear
2012
fDate
2-5 Dec. 2012
Firstpage
194
Lastpage
198
Abstract
We are interested in the problem of learning Spoken Language Understanding (SLU) models for multiple target languages. Learning such models requires annotated corpora, and porting to different languages would require corpora with parallel text translation and semantic annotations. In this paper we investigate how to learn a SLU model in a target language starting from no target text and no semantic annotation. Our proposed algorithm is based on the idea of exploiting the diversity (with regard to performance and coverage) of multiple translation systems to transfer statistically stable word-to-concept mappings in the case of the romance language pair, French and Spanish. Each translation system performs differently at the lexical level (wrt BLEU). The best translation system performances for the semantic task are gained from their combination at different stages of the portability methodology. We have evaluated the portability algorithms on the French MEDIA corpus, using French as the source language and Spanish as the target language. The experiments show the effectiveness of the proposed methods with respect to the source language SLU baseline.
Keywords
language translation; learning (artificial intelligence); natural language processing; French MEDIA corpus; French language; SLU model learning problem; Spanish language; annotated corpora; multiple translation systems; parallel text translation; portability algorithms; romance language pair; semantic annotations; spoken language understanding portability; statistically stable word-to-concept mappings; target languages; translation system performances; Automata; Media; Semantics; Speech; Speech recognition; Stochastic processes; Training; Language Portability; Spoken Language Understanding; Statistical Models;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop (SLT), 2012 IEEE
Conference_Location
Miami, FL
Print_ISBN
978-1-4673-5125-6
Electronic_ISBN
978-1-4673-5124-9
Type
conf
DOI
10.1109/SLT.2012.6424221
Filename
6424221
Link To Document