DocumentCode :
2791416
Title :
Multi-style MLP features for BN transcription
Author :
Le, Viet-Bac ; Lamel, Lori ; Gauvain, Jean-Luc
Author_Institution :
Spoken Language Process. Group, LIMSI-CNRS, Orsay, France
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
4866
Lastpage :
4869
Abstract :
It has become common practice to adapt acoustic models to specific-conditions (gender, accent, bandwidth) in order to improve the performance of speech-to-text (STT) transcription systems. With the growing interest in the use of discriminative features produced by a multi layer perceptron (MLP) in such systems, the question arise of whether it is necessary to specialize the MLP to particular conditions, and if so, how to incorporate the condition-specific MLP features in the system. This paper explores three approaches (adaptation, full training, and feature merging) to use condition-specific MLP features in a state-of-the-art BN STT system for French. The third approach without condition-specific adaptation was found to outperform the original models with condition-specific adaptation, and was found to perform almost as well as full training of multiple condition-specific HMMs.
Keywords :
feature extraction; multilayer perceptrons; speech recognition; acoustic model; condition-specific adaptation; discriminative feature; multi layer perception; multistyle MLP feature; speech to text transcription system; Adaptation model; Bandwidth; Broadcasting; Cepstral analysis; Computer architecture; Hidden Markov models; Merging; Natural languages; Speech; Training data; BN transcription; MLP features; condition-specific adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495116
Filename :
5495116
Link To Document :
بازگشت