DocumentCode :
730709
Title :
Learning feature mapping using deep neural network bottleneck features for distant large vocabulary speech recognition
Author :
Himawan, Ivan ; Motlicek, Petr ; Imseng, David ; Potard, Blaise ; Namhoon Kim ; Jaewon Lee
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4540
Lastpage :
4544
Abstract :
Automatic speech recognition from distant microphones is a difficult task because recordings are affected by reverberation and background noise. First, the application of the deep neural network (DNN)/hidden Markov model (HMM) hybrid acoustic models for distant speech recognition task using AMI meeting corpus is investigated. This paper then proposes a feature transformation for removing reverberation and background noise artefacts from bottleneck features using DNN trained to learn the mapping between distant-talking speech features and close-talking speech bottleneck features. Experimental results on AMI meeting corpus reveal that the mismatch between close-talking and distant-talking conditions is largely reduced, with about 16% relative improvement over conventional bottleneck system (trained on close-talking speech). If the feature mapping is applied to close-talking speech, a minor degradation of 4% relative is observed.
Keywords :
hidden Markov models; microphones; neural nets; reverberation; speech recognition; AMI meeting corpus; automatic speech recognition; background noise; close-talking speech bottleneck features; deep neural network bottleneck features; distant large vocabulary speech recognition; distant microphones; distant speech recognition task; distant-talking speech features; feature mapping; feature transformation; hidden Markov model; hybrid acoustic models; reverberation noise; Acoustics; Adaptation models; Feature extraction; Hidden Markov models; Speech; Speech recognition; Training; AMI corpus; Deep neural network; bottleneck features; distant speech recognition; meetings;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178830
Filename :
7178830
Link To Document :
بازگشت