مرکز منطقه ای اطلاع رساني علوم و فناوري - An improved HMM/VQ training procedure for speaker-independent isolated word recognition

DocumentCode :

2286830

Title :

An improved HMM/VQ training procedure for speaker-independent isolated word recognition

Author :

Zhang, Yaxin ; Alder, Mike

Author_Institution :

Dept. of Electr. & Electron. Eng., Western Australia Univ., Nedlands, WA, Australia

fYear :

1994

fDate :

13-16 Apr 1994

Firstpage :

722

Abstract :

This paper describe an improved training procedure in a HMM/VQ speech recognition system for speaker-independent speech recognition. The phoneme based Gaussian mixture models (GMM) were generated in the first step modeling using the Expectation-Maximization (EM) algorithm. These Gaussians more accurately describe the distribution characteristic of the phonemes in the speech signal space. Therefore better first step modeling is achieved and the performance of the whole recognition system is improved. The new method was used in a speaker-independent isolated digits and phoneme recognition tasks. Two English databases were used for the training and testing. Significant improvements have been achieved in comparison with the conventional HMM/VQ system

Keywords :

hidden Markov models; speech recognition; stochastic processes; vector quantisation; English databases; HMM/VQ training procedure; distribution characteristic; expectation-maximization algorithm; phoneme based Gaussian mixture models; speaker-independent isolated word recognition; speech signal space; Books; Clustering algorithms; Hidden Markov models; Image coding; Image recognition; Signal generators; Signal processing; Signal processing algorithms; Speech recognition; Vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on

Print_ISBN :

0-7803-1865-X

Type :

conf

DOI :

10.1109/SIPNN.1994.344810

Filename :

344810

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2286830