Title :
Multi-objective optimization for semi-supervised discriminative language modeling
Author :
Kobayashi, Akio ; Oku, Takahiro ; Imai, Toru ; Nakagawa, Seiichi
Author_Institution :
NHK Sci. & Technol. Res. Labs., Tokyo, Japan
Abstract :
A method for semi-supervised language modeling, which was designed to improve the robustness of a language model (LM) obtained from manually transcribed (labeled) data, is proposed. The LM is implemented as a log-linear model, which employs a set of linguistic features derived from word or phoneme n-grams. The proposed method is formulated as a multi-objective optimization programming problem (MOP), which consists of two objective functions based on expected risks for labeled lattices and automatic speech recognition (ASR) lattices as unlabeled training data. The model is trained in a discriminative manner and acquired as a solution to the problem. In transcribing Japanese broadcast programs, the proposed method reduced word error rate by 6.3% compared with that achieved by a conventional trigram LM.
Keywords :
optimisation; speech recognition; ASR lattices; Japanese broadcast programs; LM; MOP problem; automatic speech recognition lattices; discriminative manner; labeled lattices; linguistic features; log-linear model; multiobjective optimization; multiobjective optimization programming problem; semisupervised discriminative language modeling; Adaptation models; Data models; Lattices; Linear programming; Optimization; Speech; Training; Bayes risk minimization; discriminative training; language modeling; semi-supervised training;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289042