DocumentCode :
3162967
Title :
Factorial Hidden Restricted Boltzmann Machines for noise robust speech recognition
Author :
Rennie, Steven J. ; Fousek, Petr ; Dognin, Pierre L.
Author_Institution :
IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4297
Lastpage :
4300
Abstract :
We present the Factorial Hidden Restricted Boltzmann Machine (FHRBM) for robust speech recognition. Speech and noise are modeled as independent RBMs, and the interaction between them is explicitly modeled to capture how speech and noise combine to generate observed noisy speech features. In contrast with RBMs, where the bottom layer of random variables is observed, inference in the FHRBM is intractable, scaling exponentially with the number of hidden units. We introduce variational algorithms for efficient approximate inference that scale linearly with the number of hidden units. Compared to traditional factorial models of noisy speech, which are based on GMMs, the FHRBM has the advantage that the representations of both speech and noise are highly distributed, allowing the model to learn a parts-based representation of noisy speech data that can generalize better to previously unseen noise compositions. Preliminary results suggest that the approach is promising.
Keywords :
Boltzmann machines; Gaussian processes; inference mechanisms; source separation; speech recognition; FHRBM; GMM; Gaussian mixture model; factorial hidden restricted Boltzmann machines; independent RBM; inference; noise robust speech recognition; noisy speech data; noisy speech features; random variables; speech representation; variational algorithms; Acoustics; Hidden Markov models; Noise; Random variables; Robustness; Speech; Speech recognition; Deep Belief Networks; Restricted Boltzmann Machines; Robust Speech Recognition; Source Separation; Variational Methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288869
Filename :
6288869
Link To Document :
بازگشت