مرکز منطقه ای اطلاع رساني علوم و فناوري - Towards knowledge-based features for HMM based large vocabulary automatic speech recognition

DocumentCode :

542305

Title :

Towards knowledge-based features for HMM based large vocabulary automatic speech recognition

Author :

Launay, Benoit ; Siohan, Olivier ; Surendran, Arun ; Lee, Chin-Hui

Author_Institution :

Multimedia Communications Research Lab, Bell Laboratories - Lucent Technologies, 600 Mountain Ave., Murray Hill, NJ 07974, USA

Volume :

fYear :

2002

fDate :

13-17 May 2002

Abstract :

This paper describes an attempt to design a knowledge-based large vocabulary speech recognition system. Our motivation is to replace features based on the short-term spectra, such as Mel-frequency cepstral coefficients (MFCC), by features that explicitly represent some of the distinctive features of the speech signal. However, rather than attempting to compute acoustic correlates of these distinctive features, we have engineered an approach where neural networks are trained to map short-term spectral features to the posterior probability of some distinctive features. These probabilities are then used as features in a large vocabulary tied-state HMM-based recognizer. Experimental results on the Wall Street Journal Task show that such a system, while not outperforming a MFCC-based system, generates very different error patterns. After combining the results of a base-line MFCC system with the results of several systems based on the proposed approach, we were able to obtain reductions in word error rates of 19% and 10 % on the 5K and 20K tasks respectively over our best MFCC-based systems.

Keywords :

Artificial neural networks; Computational modeling; Feature extraction; Hidden Markov models; Hoses; Markov processes; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location :

Orlando, FL, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.2002.5743864

Filename :

5743864

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=542305