Title :
Extracting clues from human interpreter speech for spoken language translation
Author :
Paulik, Matthias ; Waibel, Alex
Author_Institution :
Interactive Syst. Labs. (interACT), Carnegie Mellon Univ., Pittsburgh, PA
fDate :
March 31 2008-April 4 2008
Abstract :
In previous work, we reported dramatic improvements in automatic speech recognition (ASR) and spoken language translation (SLT) gained by applying information extracted from spoken human interpretations. These interpretations were artificially created by collecting read sentences from a clean parallel text corpus. Real human interpretations are significantly different. They suffer from frequent synopses, omissions and self-corrections. Expressing these differences in BLEU score by evaluating human interpretations with carefully created human translations, we found that human interpretations perform two to three times worse than state-of-the art SLT. Facing these stark differences, we address the question if and how ASR and SLT can profit from human interpretations. In the following we describe initial experiments that apply knowledge derived from real human interpretations for improving English and Spanish ASR and SLT. Our experiments are conducted on a small European Parliamentary Plenary Sessions development set.
Keywords :
language translation; natural language processing; speech recognition; automatic speech recognition; human interpreter speech; spoken human interpretations; spoken language translation; Art; Automatic speech recognition; Broadcasting; Contracts; Data mining; Humans; Interactive systems; Laboratories; Natural languages; Performance evaluation; STE-ASR; spoken language translation; tight coupling;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518805