DocumentCode :
1686353
Title :
Gaussian-Bernoulli restricted Boltzmann machines and automatic feature extraction for noise robust missing data mask estimation
Author :
Keronen, Sami ; KyungHyun Cho ; Raiko, Tapani ; Ilin, Alexander ; Palomaki, Kalle
Author_Institution :
Sch. of Sci., Dept. of Inf. & Comput. Sci., Aalto Univ., Aalto, Finland
fYear :
2013
Firstpage :
6729
Lastpage :
6733
Abstract :
A missing data mask estimation method based on Gaussian-Bernoulli restricted Boltzmann machine (GRBM) trained on cross-correlation representation of the audio signal is presented in the study. The automatically learned features by the GRBM are utilized in dividing the time-frequency units of the spectrographic mask into noise and speech dominant. The system is evaluated against two baseline mask estimation methods in a reverberant multisource environment speech recognition task. The proposed system is shown to provide a performance improvement in the speech recognition accuracy over the previous multifeature approaches.
Keywords :
Boltzmann machines; audio signal processing; data handling; feature extraction; spectroscopy; speech recognition; GRBM; Gaussian-Bernoulli restricted Boltzmann machines; audio signal; automatic feature extraction; baseline mask estimation methods; cross-correlation representation; multifeature approaches; noise robust missing data mask estimation; reverberant multisource environment speech recognition task; spectrographic mask; time-frequency units; Estimation; Feature extraction; Noise; Speech; Speech recognition; Support vector machines; Vectors; GRBM; Noise robust; deep learning; mask estimation; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6638964
Filename :
6638964
Link To Document :
بازگشت