DocumentCode :
730672
Title :
Multi-basis adaptive neural network for rapid adaptation in speech recognition
Author :
Chunyang Wu ; Gales, Mark J. F.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge, UK
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4315
Lastpage :
4319
Abstract :
Recent progress in acoustic modeling with deep neural network has significantly improved the performance of automatic speech recognition systems. However, it remains as an open problem how to rapidly adapt these networks with limited, unsupervised, data. Most existing methods to adapt a neural network involve modifying a large number of parameters thus rapid adaptation is not possible with these schemes. In this paper, the multi-basis adaptive neural network is proposed, a new neural network configuration which only requires very few parameters for adaptation. By modifying the topology of a single multi-layer perception, a set of sub-networks with restricted connectivity are introduced to collaboratively capture different acoustic properties. The outputs of those sub-networks are combined by speaker-dependent interpolation weights. In addition, the complete system can be optimized in an adaptive training fashion when non-homogeneous training data are used. The performance of unsupervised adaptation is evaluated on two datasets. It outperforms the speaker-independent hybrid DNN-HMM baseline both on the Broadcast News English and the AURORA-4 tasks.
Keywords :
acoustic signal processing; adaptive signal processing; interpolation; multilayer perceptrons; speech recognition; acoustic modeling; automatic speech recognition systems; deep neural network; multibasis adaptive neural network; multilayer perception; neural network configuration; nonhomogeneous training data; speaker dependent interpolation weight; speaker-independent hybrid DNN-HMM; Acoustics; Adaptation models; Hidden Markov models; Neural networks; Silicon; Speech; Training; Adaptation; deep neural network; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178785
Filename :
7178785
Link To Document :
بازگشت