مرکز منطقه ای اطلاع رساني علوم و فناوري - Phoneme HMMs constrained by frame correlations

DocumentCode :

2020452

Title :

Phoneme HMMs constrained by frame correlations

Author :

Takahashi, Satoshi ; Matsuoka, Tatsuo ; Minami, Yasuhiro ; Shikano, Kiyohiro

Author_Institution :

NTT Human Interface Lab., Tokyo, Japan

Volume :

fYear :

1993

fDate :

27-30 April 1993

Firstpage :

219

Abstract :

Phoneme HMMs (hidden Markov models) that use correlations between two frames are proposed. The proposed technique constrains the output probability distributions of speaker-independent HMMs so that they are suitable for the input speaker. The speaker-dependent BC (bigram-constrained)-HMMs and speaker-independent BC-HMMs are generated from the conventional speaker-independent HMMs by combining the VQ (vector quantization)-code bigram (discrete case and tied-mixture case) or the conditional Gaussian density function (continuous case). The new models were evaluated by 23-phoneme recognition in continuous speech. In the speaker-dependent BC-HMMs, which use the speaker-dependent bigram created by 50 additional sentences of the test speaker, the best recognition accuracy of 74.8% was obtained by the tied-mixture type BC-HMMs. In the speaker-independent BC-HMMs, the best recognition accuracy of 67.5% was obtained by the continuous type BC-HMMs.<>

Keywords :

constraint handling; correlation methods; hidden Markov models; speech recognition; vector quantisation; conditional Gaussian density function; correlations between two frames; hidden Markov models; output probability distributions; recognition accuracy; recognition in continuous speech; vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on

Conference_Location :

Minneapolis, MN, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.1993.319274

Filename :

319274

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2020452