Title :
A confidence-based approach for improving keyword hypothesis scores
Author :
Seigel, M.S. ; Woodland, Philip C. ; Gales, Mark J.F.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge, UK
Abstract :
The task in keyword spotting (KWS) is to hypothesise times at which any of a set of key terms occurs in audio. An important aspect of such systems are the scores assigned to these hypotheses, the accuracy of which have a significant impact on performance. Estimating these scores may be formulated as a confidence estimation problem, where a measure of confidence is assigned to each key term hypothesis. In this work, a set of discriminative features is defined, and combined using a conditional random field (CRF) model for improved confidence estimation. An extension to this model to directly address the problem of score normalisation across key terms is also introduced. The implicit score normalisation which results from applying this approach to separate systems in a hybrid configuration yields further benefits. Results are presented which show notable improvements in KWS performance using the techniques presented in this work.
Keywords :
estimation theory; random processes; speech processing; CRF model; KWS performance; audio; conditional random field; confidence based approach; confidence estimation problem; confidence measure; discriminative features; hypothesise times; implicit score normalisation; keyword hypothesis scores; keyword spotting; Context; Estimation; Feature extraction; Lattices; Speech; Training; Vocabulary; conditional random fields; confidence estimation; keyword spotting; spoken term detection;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639337