DocumentCode :
1695120
Title :
Speed regularization and optimality in word classing
Author :
Zweig, Geoffrey ; Makarychev, Konstantin
Author_Institution :
Microsoft Res., Redmond, WA, USA
fYear :
2013
Firstpage :
8237
Lastpage :
8241
Abstract :
Word-classing has been used in language modeling for two distinct purposes: to improve the likelihood of the language model, and to improve the runtime speed. In particular, frequency-based heuristics have been proposed to improve the speed of recurrent neural network language models (RNN-LMs). In this paper, we present a dynamic programming algorithm for determining classes in a way that provably minimizes the runtime of the resulting class-based language models. However, we also find that the speed-based methods degrade the perplexity of the language models by 5-10% over traditional likelihood-based classing. We remedy this via the introduction of a speed-based regularization term in the likelihood objective function. This achieves a runtime close to that of the speed based methods without loss in perplexity performance. We demonstrate these improvements with both an RNN-LM and the Model M exponential language model, for three different tasks involving two different languages.
Keywords :
dynamic programming; natural languages; recurrent neural nets; class-based language models; dynamic programming algorithm; frequency-based heuristics; language modeling; likelihood objective function; likelihood-based classing; model M exponential language model; recurrent neural network language models; speed regularization; speed-based regularization term; word classing; Computational modeling; Electronic publishing; Encyclopedias; Linear programming; Runtime; Training; Language Modeling; Model M; Recurrent Neural Network; Word Classes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639271
Filename :
6639271
Link To Document :
بازگشت