DocumentCode :
1695957
Title :
Adaptation of lecture speech recognition system with machine translation output
Author :
Ng, Raymond W. M. ; Hain, Thomas ; Cohn, Trevor
Author_Institution :
Dept. of Comput. Sci., Univ. of Sheffield, Sheffield, UK
fYear :
2013
Firstpage :
8401
Lastpage :
8405
Abstract :
In spoken language translation, integration of the ASR and MT components is critical for good performance. In this paper, we consider the recognition setting where a text translation of each utterance is also available. We present experiments with different ASR system adaptation techniques to exploit MT system outputs. In particular, N-best MT outputs are represented as an utterance-specific language model, which are then used to rescore ASR lattices. We show that this method improves significantly over ASR alone, resulting in an absolute WER reduction of more than 6% for both indomain and out-of-domain acoustic models.
Keywords :
language translation; speech recognition; ASR components; ASR lattices; ASR system adaptation techniques; MT components; MT system; N-best MT outputs; WER reduction; in-domain acoustic models; lecture speech recognition system adaptation; machine translation output; out-of-domain acoustic models; recognition setting; spoken language translation; text translation; utterance-specific language model; Acoustics; Adaptation models; Data models; Interpolation; Speech; Speech recognition; Training; TED talks; language model adaptation; speech translation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639304
Filename :
6639304
Link To Document :
بازگشت