DocumentCode :
1686317
Title :
MLP-based factor analysis for tandem speech recognition
Author :
Ferras, Marc ; Bourlard, Herve
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
fYear :
2013
Firstpage :
6719
Lastpage :
6723
Abstract :
In the last years, latent variable models such as factor analysis, probabilistic principal component analysis or subspace Gaussian mixture models have become almost ubiquitous in speech technologies. The key to its success is the joint modeling of multiple effects in the speech signal they address. In this paper, we propose a novel approach to use phone and speaker variabilities together to estimate phone posterior probabilities on a tandem speech recognition system. A Multilayer Perceptron (MLP) with 5 layers and a central bottleneck linear layer is used as a basic processing block that mimics the processing undergone in factor analysis. With multiple factors, phone and a speaker MLP are merged at the bottleneck level to obtain better estimates for the phone posterior probabilities used in the ASR system. Experiments on the WSJ corpus show that the joint phone-speaker modeling can significantly outperform phone modeling alone in terms of Frame Error and Word Error Rates.
Keywords :
Gaussian processes; multilayer perceptrons; principal component analysis; speech recognition; ASR system; MLP-based factor analysis; frame error rate; joint phone-speaker modeling; multilayer perceptron; phone posterior probability; probabilistic principal component analysis; speaker MLP; speech signal; speech technology; subspace Gaussian mixture model; tandem speech recognition system; word error rate; Adaptation models; Analytical models; Hidden Markov models; Speech; Training; Training data; Vectors; factor analysis; multilayer perceptron; neural network; tandem speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6638962
Filename :
6638962
Link To Document :
بازگشت