Title :
Optimising Figure of Merit for phonetic spoken term detection
Author :
Wallace, Roy ; Vogt, Robbie ; Baker, Brendan ; Sridharan, Sridha
Author_Institution :
Speech & Audio Res. Lab., Queensland Univ. of Technol. (QUT), Brisbane, QLD, Australia
Abstract :
This paper introduces a novel technique to directly optimise the Figure of Merit (FOM) for phonetic spoken term detection. The FOM is a popular measure of STD accuracy, making it an ideal candidate for use as an objective function. A simple linear model is introduced to transform the phone log-posterior probabilities output by a phone classifier to produce enhanced log-posterior features that are more suitable for the STD task. Direct optimisation of the FOM is then performed by training the parameters of this model using a nonlinear gradient descent algorithm. Substantial FOM improvements of 11% relative are achieved on held-out evaluation data, demonstrating the generalisability of the approach.
Keywords :
probability; speech processing; speech recognition; figure of merit; phone log-posterior probabilities output; phonetic spoken term detection; simple linear model; speech recognition; Australia; Decoding; Error analysis; Indexing; Information retrieval; Laboratories; Magneto electrical resistivity imaging technique; Speech processing; Speech recognition; Viterbi algorithm; information retrieval; speech processing; speech recognition; spoken term detection;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5494969