DocumentCode :
3166677
Title :
Multi-objective optimization for semi-supervised discriminative language modeling
Author :
Kobayashi, Akio ; Oku, Takahiro ; Imai, Toru ; Nakagawa, Seiichi
Author_Institution :
NHK Sci. & Technol. Res. Labs., Tokyo, Japan
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4997
Lastpage :
5000
Abstract :
A method for semi-supervised language modeling, which was designed to improve the robustness of a language model (LM) obtained from manually transcribed (labeled) data, is proposed. The LM is implemented as a log-linear model, which employs a set of linguistic features derived from word or phoneme n-grams. The proposed method is formulated as a multi-objective optimization programming problem (MOP), which consists of two objective functions based on expected risks for labeled lattices and automatic speech recognition (ASR) lattices as unlabeled training data. The model is trained in a discriminative manner and acquired as a solution to the problem. In transcribing Japanese broadcast programs, the proposed method reduced word error rate by 6.3% compared with that achieved by a conventional trigram LM.
Keywords :
optimisation; speech recognition; ASR lattices; Japanese broadcast programs; LM; MOP problem; automatic speech recognition lattices; discriminative manner; labeled lattices; linguistic features; log-linear model; multiobjective optimization; multiobjective optimization programming problem; semisupervised discriminative language modeling; Adaptation models; Data models; Lattices; Linear programming; Optimization; Speech; Training; Bayes risk minimization; discriminative training; language modeling; semi-supervised training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289042
Filename :
6289042
Link To Document :
بازگشت