Title :
Optimization of Neural Network Language Models for keyword search
Author :
Gandhe, Ankur ; Metze, Florian ; Waibel, Alex ; Lane, Ian
Author_Institution :
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Recent works have shown Neural Network based Language Models (NNLMs) to be an effective modeling technique for Automatic Speech Recognition. Prior works have shown that these models obtain lower perplexity and word error rate (WER) compared to both standard n-gram language models (LMs) and more advanced language models including maximum entropy and random forest LMs. While these results are compelling, prior works were limited to evaluating NNLMs on perplexity and word error rate. Our initial results showed that while NNLMs improved speech recognition accuracy, the improvement in keyword search was negligible. In this paper we propose alternate optimizations of NNLMs for the task of keyword search. We evaluate the performance of the proposed methods for keyword search on the Vietnamese dataset provided in phase one of the BABEL1 project and demonstrate that by penalizing low frequency words during NNLM training, keyword search metrics such as actual term weighted value (ATWV) can be improved by up to 9.3% compared to the standard training methods.
Keywords :
information retrieval; learning (artificial intelligence); neural nets; word processing; ATWV; BABEL project; NNLM training; Vietnamese dataset; actual term weighted value; keyword search metrics; neural network language model optimization; Artificial neural networks; Computational modeling; History; Keyword search; Speech recognition; Training; keyword search; language modeling; neural networks;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854531