• DocumentCode
    417669
  • Title

    Towards language portability in statistical speech translation

  • Author

    Waibel, Alex ; Schultz, Tanja ; Vogel, Stephan ; Fügen, Christian ; Honal, Matthias ; Kolss, Muntsin ; Reichert, Jürgen ; Stüker, Sebastian

  • Author_Institution
    Interactive Syst. Labs., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    3
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Speech translation has made significant advances over the last years. We believe that we can overcome today´s limits of language and domain portable conversational speech translation systems by relying more radically on learning approaches and by the use of multiple layers of reduction and transformation to extract the desired content in another language. Therefore, we cascade stochastic source-channel models that extract an underlying message from a corrupt observed output. The three models effectively translate: (1) speech to word lattices (automatic speech recognition, ASR); (2) ill-formed fragments of word strings into a compact well-formed sentence (Clean); (3) sentences in one language to sentences in another (machine translation, MT). We present results of our research efforts towards rapid language portability of all these components. The results on translation suggest that MT systems can be successfully constructed for any language pair by cascading multiple MT systems via English. Moreover, end-to-end performance can be improved, if the interlingua language is enriched with additional linguistic information that can be derived automatically and monolingually in a data-driven fashion.
  • Keywords
    language translation; learning (artificial intelligence); natural languages; speech recognition; statistical analysis; stochastic processes; ASR; MT; automatic speech recognition; language portability; learning approaches; linguistic information; statistical machine translation; statistical speech translation; stochastic source-channel models; Automatic speech recognition; Cleaning; Interactive systems; Laboratories; Lattices; Natural languages; Speech enhancement; Speech recognition; Stochastic processes; Surface-mount technology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326657
  • Filename
    1326657