مرکز منطقه ای اطلاع رساني علوم و فناوري - Language model adaptation using WFST-based speaking-style translation

DocumentCode :

394235

Title :

Language model adaptation using WFST-based speaking-style translation

Author :

Hori, Takaaki ; Willett, Daniel ; Minami, Yasuhiro

Author_Institution :

Speech Open Lab., NTT Commun. Sci. Labs., Kyoto, Japan

Volume :

fYear :

2003

fDate :

6-10 April 2003

Abstract :

This paper describes a new approach to language model adaptation for speech recognition based on the statistical framework of speech translation. The main idea of this approach is to compose a weighted finite-state transducer (WFST) that translates sentence styles from in-domain to out-of-domain. It enables to integrate language models of different styles of speaking or dialects and even of different vocabularies. The WFST is built by combining in-domain and out-of-domain models through the translation, while each model and the translation itself is expressed as a WFST. We apply this technique to building language models for spontaneous speech recognition using large written-style corpora. We conducted experiments on a 20k-word Japanese spontaneous speech recognition task. With a small in-domain corpus, a 2.9% absolute improvement in word error rate is achieved over the in-domain model.

Keywords :

linguistics; natural languages; speech recognition; Japanese spontaneous speech recognition task; WFST; dialects; in-domain models; language model adaptation; out-of-domain models; sentence styles; speech recognition; speech translation; statistical framework; vocabularies; weighted finite-state transducer; Acoustic transducers; Adaptation model; Context modeling; Interpolation; Laboratories; Natural languages; Speech recognition; Statistics; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-7663-3

Type :

conf

DOI :

10.1109/ICASSP.2003.1198759

Filename :

1198759

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=394235