DocumentCode :
1843261
Title :
Unsupervised and semi-supervised adaptation of a hybrid speech recognition system
Author :
Trmal, Jan ; Zelinka, J. ; Muller, Lukas
Author_Institution :
Dept. of Cybern., Univ. of West Bohemia, Pilsen, Czech Republic
Volume :
1
fYear :
2012
fDate :
21-25 Oct. 2012
Firstpage :
527
Lastpage :
530
Abstract :
This paper evaluates a recently published method for supervised and unsupervised adaptation of neural networks used in hybrid speech recognition systems. The neural networks used in the field of hybrid speech recognition have certain distinct characteristics that make the usual adaptation methods (such as retraining the neural network) unusable or ineffective. The recently published MELT (Minimum Error Linear Transform) method [1] has been developed to cope with this issue. By providing a way of establishing a link between the intermediate features and the long temporal features, the number of free variables can be reduced significantly and the resulting adaptation parameters can be estimated robustly. The experiments were performed on the WSJCAM0 speech corpus. Contrary to the original paper [1], the experiments were performed using a word recognizer instead of a phoneme recognizer. The experimental results suggest that the MELT method can be used both in an unsupervised as well as a semi-supervised manner and when applied, it leads to significant reduction of word error rate, even for a strong language model.
Keywords :
neural nets; speech recognition; transforms; MELT method; WSJCAM0 speech corpus; hybrid speech recognition system; intermediate features; language model; minimum error linear transform method; neural networks; phoneme recognizer; semisupervised adaptation; temporal features; unsupervised adaptation; word recognizer; MELT; Neural Networks; Speaker Adaptation; Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing (ICSP), 2012 IEEE 11th International Conference on
Conference_Location :
Beijing
ISSN :
2164-5221
Print_ISBN :
978-1-4673-2196-9
Type :
conf
DOI :
10.1109/ICoSP.2012.6491542
Filename :
6491542
Link To Document :
بازگشت