Title :
Language model adaptation using WFST-based speaking-style translation
Author :
Hori, Takaaki ; Willett, Daniel ; Minami, Yasuhiro
Author_Institution :
Speech Open Lab., NTT Commun. Sci. Labs., Kyoto, Japan
Abstract :
This paper describes a new approach to language model adaptation for speech recognition based on the statistical framework of speech translation. The main idea of this approach is to compose a weighted finite-state transducer (WFST) that translates sentence styles from in-domain to out-of-domain. It enables to integrate language models of different styles of speaking or dialects and even of different vocabularies. The WFST is built by combining in-domain and out-of-domain models through the translation, while each model and the translation itself is expressed as a WFST. We apply this technique to building language models for spontaneous speech recognition using large written-style corpora. We conducted experiments on a 20k-word Japanese spontaneous speech recognition task. With a small in-domain corpus, a 2.9% absolute improvement in word error rate is achieved over the in-domain model.
Keywords :
linguistics; natural languages; speech recognition; Japanese spontaneous speech recognition task; WFST; dialects; in-domain models; language model adaptation; out-of-domain models; sentence styles; speech recognition; speech translation; statistical framework; vocabularies; weighted finite-state transducer; Acoustic transducers; Adaptation model; Context modeling; Interpolation; Laboratories; Natural languages; Speech recognition; Statistics; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198759