مرکز منطقه ای اطلاع رساني علوم و فناوري - On including temporal constraints in Viterbi alignment for speech recognition in noise

DocumentCode :

1437316

Title :

On including temporal constraints in Viterbi alignment for speech recognition in noise

Author :

Yoma, Nestor Becerra ; McInnes, Fergus R. ; Jack, Mervyn A. ; Stump, Sandra Dotto ; Ling, Lee Luan

Author_Institution :

Dept. of Electr. Eng., Chile Univ., Santiago, Chile

Volume :

Issue :

fYear :

2001

fDate :

2/1/2001 12:00:00 AM

Firstpage :

179

Lastpage :

182

Abstract :

This paper addresses the problem of temporal constraints in the Viterbi algorithm in speaker-dependent and independent tasks. The results here presented suggest that in a speaker-dependent task the introduction of temporal constraints can lead to a high improvement with additive or convolutional noise, the statistical modeling of state durations is not relevant if the max and min state duration restrictions are imposed, and truncated probability densities give better results than a metric previously proposed. Finally, word position dependent and independent temporal restrictions are compared in connected word speech recognition experiments and it is shown that the former leads to better results with the same computational load. However the duration model effect could be much less significant when the acoustic model is optimized and when the training and testing conditions are matched

Keywords :

noise; probability; speech recognition; statistical analysis; Viterbi algorithm; Viterbi alignment; acoustic model; additive noise; computational load; connected word speech recognition experiments; convolutional noise; duration model effect; max state duration; min state duration; noise; speaker independent task; speaker-dependent task; speech recognition; state durations; statistical modeling; temporal constraints; testing conditions; training conditions; truncated probability densities; word position dependent temporal restrictions; word position independent temporal restrictions; Acoustic testing; Additive noise; Convolution; Hidden Markov models; Lifting equipment; Load modeling; Probability; Speech recognition; Topology; Viterbi algorithm;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.902285

Filename :

902285

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1437316