DocumentCode :
3020724
Title :
HMM-Based speech recognition using multi-dimensional multi-labeling
Author :
Nishimura, Masafumi ; Toshioka, Koichi
Author_Institution :
Tokyo Research Laboratory, IBM Japan Ltd., Tokyo, Japan
Volume :
12
fYear :
1987
fDate :
31868
Firstpage :
1163
Lastpage :
1166
Abstract :
This paper describes a new vector quantization (VQ; so-called labeling) method of a speech recognition system based on hidden Markov model (HMM). For improving the VQ accuracy in a simple manner, "multi-labeling" which generates multiple labels at each frame was introduced while keeping a conventional HMM formulation. Furthermore, in order to represent characteristics of speech accurately and effectively, "multi-dimensional labeling" was also introduced which quantizes multiple features such as spectral dynamics and spectrum independently. This labeling method was tested in an isolated word recognition task using 150 Japanese confusable words. The recognition error rate was roughly reduced to 1/2 or less compared with the conventional method.
Keywords :
Cognition; Density functional theory; Error analysis; Fluctuations; Hidden Markov models; Labeling; Laboratories; Speech recognition; Testing; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
Type :
conf
DOI :
10.1109/ICASSP.1987.1169883
Filename :
1169883
Link To Document :
بازگشت