مرکز منطقه ای اطلاع رساني علوم و فناوري - Isolated-utterance speech recognition using hidden Markov models with bounded state durations

DocumentCode :

1457129

Title :

Isolated-utterance speech recognition using hidden Markov models with bounded state durations

Author :

Gu, Hung-Yan ; Tseng, Chiu-Yu ; Lee, Lin-shan

Author_Institution :

Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan

Volume :

Issue :

fYear :

1991

fDate :

8/1/1991 12:00:00 AM

Firstpage :

1743

Lastpage :

1752

Abstract :

Hidden Markov models (HMMs) with bounded state durations (HMM/BSD) are proposed to explicitly model the state durations of HMMs and more accurately consider the temporal structures existing in speech signals in a simple, direct, but effective way. A series of experiments have been conducted for speaker dependent applications using 408 highly confusing first-tone Mandarin syllables as the example vocabulary. It was found that in the discrete case the recognition rate of HMM/BSD (78.5%) is 9.0%, 6.3%, and 1.9% higher than the conventional HMMs and HMMs with Poisson and gamma distribution state durations, respectively. In the continuous case (partitioned Gaussian mixture modeling), the recognition rates of HMM/BSD (88.3% with 1 mixture, 88.8% with 3 mixtures, and 89.4% with 5 mixtures) are 6.3%, 5.0%, and 5.5% higher than those of the conventional HMMs, and 5.9% (with 1 mixture), 3.9% (with 3 mixtures) and 3.1% (with 1 mixture), 1.8% (with 3 mixtures) higher than HMMs with Poisson and gamma distributed state durations, respectively

Keywords :

Markov processes; speech recognition; HMM; Mandarin syllables; bounded state durations; hidden Markov models; isolated utterance recognition; speaker dependent applications; speech recognition; Computer science; Hidden Markov models; History; Parametric statistics; Phase estimation; Probability density function; Speech recognition; State estimation; Vector quantization; Vocabulary;

fLanguage :

English

Journal_Title :

Signal Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1053-587X

Type :

jour

DOI :

10.1109/78.91145

Filename :

91145

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1457129