Title :
PAC-Bayesian approach for minimization of phoneme error rate
Author :
Keshet, Joseph ; McAllester, David ; Hazan, Tamir
Author_Institution :
Toyota Technol. Inst. at Chicago, Chicago, IL, USA
Abstract :
We describe a new approach for phoneme recognition which aims at minimizing the phoneme error rate. Building on structured prediction techniques, we formulate the phoneme recognizer as a linear combination of feature functions. We state a PAC-Bayesian generalization bound, which gives an upper-bound on the expected phoneme error rate in terms of the empirical phoneme error rate. Our algorithm is derived by finding the gradient of the PAC-Bayesian bound and minimizing it by stochastic gradient descent. The resulting algorithm is iterative and easy to implement. Experiments on the TIMIT corpus show that our method achieves the lowest phoneme error rate compared to other discriminative and generative models with the same expressive power.
Keywords :
Bayes methods; gradient methods; speech recognition; stochastic processes; PAC-Bayesian generalization bound; TIMIT corpus; feature function; iterative algorithm; phoneme error rate minimization; phoneme recognition; phoneme recognizer; stochastic gradient descent; structured prediction technique; Acoustics; Error analysis; Hidden Markov models; Kernel; Speech; Speech recognition; Training; PAC-Bayesian theorem; discriminative training; kernels; phoneme recognition; structured prediction;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5946923