مرکز منطقه ای اطلاع رساني علوم و فناوري - Discriminative language model adaptation for Mandarin broadcast speech transcription and translation

DocumentCode :

2768882

Title :

Discriminative language model adaptation for Mandarin broadcast speech transcription and translation

Author :

Liu, X.A. ; Byrne, W.J. ; Gales, M.J.F. ; de Gispert, A. ; Tomalin, M. ; Woodland, P.C. ; Yu, K.

Author_Institution :

Cambridge Univ., Cambridge

fYear :

2007

fDate :

9-13 Dec. 2007

Firstpage :

153

Lastpage :

158

Abstract :

This paper investigates unsupervised test-time adaptation of language models (LM) using discriminative methods for a Mandarin broadcast speech transcription and translation task. A standard approach to adapt interpolated language models to is to optimize the component weights by minimizing the perplexity on supervision data. This is a widely made approximation for language modeling in automatic speech recognition (ASR) systems. For speech translation tasks, it is unclear whether a strong correlation still exists between perplexity and various forms of error cost functions in recognition and translation stages. The proposed minimum Bayes risk (MBR) based approach provides a flexible framework for unsupervised LM adaptation. It generalizes to a variety of forms of recognition and translation error metrics. LM adaptation is performed at the audio document level using either the character error rate (CER), or translation edit rate (TER) as the cost function. An efficient parameter estimation scheme using the extended Baum-Welch (EBW) algorithm is proposed. Experimental results on a state-of-the-art speech recognition and translation system are presented. The MBR adapted language models gave the best recognition and translation performance and reduced the TER score by up to 0.54% absolute.

Keywords :

Bayes methods; approximation theory; error statistics; estimation theory; interpolation; language translation; natural languages; speech recognition; Mandarin broadcast speech transcription; approximation theory; automatic speech recognition; character error rate; discriminative language model adaptation; error cost function; extended Baum-Welch algorithm; interpolation; minimum Bayes risk; parameter estimation; statistical machine translation; translation edit rate; unsupervised test-time adaptation; Adaptation model; Automatic speech recognition; Broadcasting; Cost function; Error analysis; Interpolation; Natural languages; Optimization methods; Speech recognition; Surface-mount technology; discriminative training; language model adaptation; speech recognition and translation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on

Conference_Location :

Kyoto

Print_ISBN :

978-1-4244-1746-9

Electronic_ISBN :

978-1-4244-1746-9

Type :

conf

DOI :

10.1109/ASRU.2007.4430101

Filename :

4430101

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2768882