Title :
Training and adapting MLP features for Arabic speech recognition
Author :
Park, J. ; Diehl, F. ; Gales, M.J.F. ; Tomalin, M. ; Woodland, P.C.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge
Abstract :
Features derived from multilayer perceptrons (MLPs) are becoming increasingly popular for speech recognition. This paper describes various schemes for applying these features to state-of-the-art Arabic speech recognition: the use of MLP-features for short-vowel modelling in graphemic systems; rapid discriminative model training by standard PLP feature lattice reuse; and MLP feature adaptation using linear input networks (LIN). The use of rapid training using MLP features and their use for short-vowel modelling and LIN adaptation gave reductions in word error rate. However significant improvements over explicit short-vowel modelling with standard multi-pass adaptation were not obtained, although they were useful in combination.
Keywords :
learning (artificial intelligence); multilayer perceptrons; natural languages; speech recognition; Arabic speech recognition; MLP feature training; graphemic system; linear input network; multilayer perceptron; short vowel modelling; word error rate; Adaptation model; Dictionaries; Error analysis; Hidden Markov models; Lattices; Loudspeakers; Multilayer perceptrons; Speech recognition; Training data; Vocabulary; Acoustic Modelling; Arabic Speech Recognition; Multi-Layer Perceptron; Speaker Adaptation;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960620