DocumentCode :
542305
Title :
Towards knowledge-based features for HMM based large vocabulary automatic speech recognition
Author :
Launay, Benoit ; Siohan, Olivier ; Surendran, Arun ; Lee, Chin-Hui
Author_Institution :
Multimedia Communications Research Lab, Bell Laboratories - Lucent Technologies, 600 Mountain Ave., Murray Hill, NJ 07974, USA
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
This paper describes an attempt to design a knowledge-based large vocabulary speech recognition system. Our motivation is to replace features based on the short-term spectra, such as Mel-frequency cepstral coefficients (MFCC), by features that explicitly represent some of the distinctive features of the speech signal. However, rather than attempting to compute acoustic correlates of these distinctive features, we have engineered an approach where neural networks are trained to map short-term spectral features to the posterior probability of some distinctive features. These probabilities are then used as features in a large vocabulary tied-state HMM-based recognizer. Experimental results on the Wall Street Journal Task show that such a system, while not outperforming a MFCC-based system, generates very different error patterns. After combining the results of a base-line MFCC system with the results of several systems based on the proposed approach, we were able to obtain reductions in word error rates of 19% and 10 % on the 5K and 20K tasks respectively over our best MFCC-based systems.
Keywords :
Artificial neural networks; Computational modeling; Feature extraction; Hidden Markov models; Hoses; Markov processes; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743864
Filename :
5743864
Link To Document :
بازگشت