مرکز منطقه ای اطلاع رساني علوم و فناوري - Adaptation of lecture speech recognition system with machine translation output

DocumentCode :

1695957

Title :

Adaptation of lecture speech recognition system with machine translation output

Author :

Ng, Raymond W. M. ; Hain, Thomas ; Cohn, Trevor

Author_Institution :

Dept. of Comput. Sci., Univ. of Sheffield, Sheffield, UK

fYear :

2013

Firstpage :

8401

Lastpage :

8405

Abstract :

In spoken language translation, integration of the ASR and MT components is critical for good performance. In this paper, we consider the recognition setting where a text translation of each utterance is also available. We present experiments with different ASR system adaptation techniques to exploit MT system outputs. In particular, N-best MT outputs are represented as an utterance-specific language model, which are then used to rescore ASR lattices. We show that this method improves significantly over ASR alone, resulting in an absolute WER reduction of more than 6% for both indomain and out-of-domain acoustic models.

Keywords :

language translation; speech recognition; ASR components; ASR lattices; ASR system adaptation techniques; MT components; MT system; N-best MT outputs; WER reduction; in-domain acoustic models; lecture speech recognition system adaptation; machine translation output; out-of-domain acoustic models; recognition setting; spoken language translation; text translation; utterance-specific language model; Acoustics; Adaptation models; Data models; Interpolation; Speech; Speech recognition; Training; TED talks; language model adaptation; speech translation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on

Conference_Location :

Vancouver, BC

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2013.6639304

Filename :

6639304

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1695957