• DocumentCode
    591900
  • Title

    Combining multiple translation systems for Spoken Language Understanding portability

  • Author

    Garcia, Francisco ; Hurtado, L.F. ; Segarra, E. ; Sanchis, E. ; Riccardi, Giuseppe

  • Author_Institution
    Dept. Sist. Inf. i Computacio, Univ. Politec. de Valencia, Valencia, Spain
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    194
  • Lastpage
    198
  • Abstract
    We are interested in the problem of learning Spoken Language Understanding (SLU) models for multiple target languages. Learning such models requires annotated corpora, and porting to different languages would require corpora with parallel text translation and semantic annotations. In this paper we investigate how to learn a SLU model in a target language starting from no target text and no semantic annotation. Our proposed algorithm is based on the idea of exploiting the diversity (with regard to performance and coverage) of multiple translation systems to transfer statistically stable word-to-concept mappings in the case of the romance language pair, French and Spanish. Each translation system performs differently at the lexical level (wrt BLEU). The best translation system performances for the semantic task are gained from their combination at different stages of the portability methodology. We have evaluated the portability algorithms on the French MEDIA corpus, using French as the source language and Spanish as the target language. The experiments show the effectiveness of the proposed methods with respect to the source language SLU baseline.
  • Keywords
    language translation; learning (artificial intelligence); natural language processing; French MEDIA corpus; French language; SLU model learning problem; Spanish language; annotated corpora; multiple translation systems; parallel text translation; portability algorithms; romance language pair; semantic annotations; spoken language understanding portability; statistically stable word-to-concept mappings; target languages; translation system performances; Automata; Media; Semantics; Speech; Speech recognition; Stochastic processes; Training; Language Portability; Spoken Language Understanding; Statistical Models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2012 IEEE
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4673-5125-6
  • Electronic_ISBN
    978-1-4673-5124-9
  • Type

    conf

  • DOI
    10.1109/SLT.2012.6424221
  • Filename
    6424221