Document-specific context plsa language model for speech recognition

Author

Haidar, Md Akmal ; O´Shaughnessy, Douglas

Author_Institution

INRS-EMT, Montreal, QC, Canada

fYear

2015

fDate

19-24 April 2015

Firstpage

5326

Lastpage

5330

Abstract

In this paper, we introduce a document-specific context probabilistic latent semantic analysis (DCPLSA) model for speech recognition. This is an extension of a CPLSA model [1] where the probability of word is conditioned only on topics. The CPLSA model uses the bigram counts that are the number of appearances of the bigrams in the corpus. These counts are the sum of the bigram counts in different documents where they could appear to describe different topics. We encounter this problem in the CPLSA model and introduce the document-specific CPLSA model (DCPLSA) where the probability of a word is conditioned on both topic and document. We carried out experiments on a continuous speech recognition (CSR) task using the Wall Street Journal (WSJ) corpus and have seen that the proposed DCPLSA approach yields significant reduction in both perplexity and word error rate (WER) measurements over the other approaches used in the literature.

Keywords

probability; speech recognition; Wall Street Journal corpus; bigram counts; continuous speech recognition; document-specific context PLSA language model; document-specific context probabilistic latent semantic analysis model; word error rate; Adaptation models; Computational modeling; Context; Context modeling; Mathematical model; Speech recognition; Training; Topic models; bigram PLSA models; context-based PLSA language model; speech recognition; statistical language model;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location

South Brisbane, QLD

Type

conf

DOI

10.1109/ICASSP.2015.7178988

Filename

7178988