مرکز منطقه ای اطلاع رساني علوم و فناوري - HMM-Based speech recognition using multi-dimensional multi-labeling

DocumentCode :

3020724

Title :

HMM-Based speech recognition using multi-dimensional multi-labeling

Author :

Nishimura, Masafumi ; Toshioka, Koichi

Author_Institution :

Tokyo Research Laboratory, IBM Japan Ltd., Tokyo, Japan

Volume :

fYear :

1987

fDate :

31868

Firstpage :

1163

Lastpage :

1166

Abstract :

This paper describes a new vector quantization (VQ; so-called labeling) method of a speech recognition system based on hidden Markov model (HMM). For improving the VQ accuracy in a simple manner, "multi-labeling" which generates multiple labels at each frame was introduced while keeping a conventional HMM formulation. Furthermore, in order to represent characteristics of speech accurately and effectively, "multi-dimensional labeling" was also introduced which quantizes multiple features such as spectral dynamics and spectrum independently. This labeling method was tested in an isolated word recognition task using 150 Japanese confusable words. The recognition error rate was roughly reduced to 1/2 or less compared with the conventional method.

Keywords :

Cognition; Density functional theory; Error analysis; Fluctuations; Hidden Markov models; Labeling; Laboratories; Speech recognition; Testing; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.

Type :

conf

DOI :

10.1109/ICASSP.1987.1169883

Filename :

1169883

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3020724