Title :
Topic dependent language modelling for spoken term detection
Author :
Kalantari, Shahram ; Dean, David ; Sridharan, Sridha ; Wallace, Richard
Author_Institution :
Speech Lab., Queensland Univ. of Technol., Brisbane, QLD, Australia
Abstract :
This paper investigates the effect of topic dependent language models (TDLM) on phonetic spoken term detection (STD) using dynamic match lattice spotting (DMLS). Phonetic STD consists of two steps: indexing and search. The accuracy of indexing audio segments into phone sequences using phone recognition methods directly affects the accuracy of the final STD system. If the topic of a document in known, recognizing the spoken words and indexing them to an intermediate representation is an easier task and consequently, detecting a search word in it will be more accurate and robust. In this paper, we propose the use of TDLMs in the indexing stage to improve the accuracy of STD in situations where the topic of the audio document is known in advance. It is shown that using TDLMs instead of the traditional general language model (GLM) improves STD performance according to figure of merit (FOM) criteria.
Keywords :
indexing; speech recognition; GLM; STD; TDLM; dynamic match lattice spotting; figure of merit criteria; general language model; indexing stage; phone recognition methods; phone sequences; search stage; spoken term detection; topic dependent language modelling; Accuracy; Hidden Markov models; Indexing; Lattices; Speech; Speech recognition; indexing; language modelling; spoken term detection;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location :
Lisbon