مرکز منطقه ای اطلاع رساني علوم و فناوري - N-channel hidden Markov models for combined stressed speech classification and recognition

DocumentCode :

1257626

Title :

N-channel hidden Markov models for combined stressed speech classification and recognition

Author :

Womack, Brian David ; Hansen, John H L

Author_Institution :

Robust Speech Process. Lab., Colorado Univ., Boulder, CO, USA

Volume :

Issue :

fYear :

1999

fDate :

11/1/1999 12:00:00 AM

Firstpage :

668

Lastpage :

677

Abstract :

Robust speech recognition systems must address variations due to perceptually induced stress in order to maintain acceptable levels of performance in adverse conditions. One approach for addressing these variations is to utilize front-end stress classification to direct a stress dependent recognition algorithm which separately models each speech production domain. This study proposes a new approach which combines stress classification and speech recognition functions into one algorithm. This is accomplished by generalizing the one-dimensional (1-D) hidden Markov model to an N-channel hidden Markov model (N-channel HMM). Here, each stressed speech production style under consideration is allocated a dimension in the N-channel HMM to model each perceptually induced stress condition. It is shown that this formulation better integrates perceptually induced stress effects for stress independent recognition. This is due to the sub-phoneme (state level) stress classification that is implicitly performed by the algorithm. The proposed N-channel stress independent HMM method is compared to a previously established one-channel stress dependent isolated word recognition system yielding a 73.8% reduction in error rate. In addition, an 82.7% reduction in error rate is observed compared to the common one-channel neutral trained recognition approach

Keywords :

hidden Markov models; signal classification; speech recognition; 1D HMM; N-channel HMM; N-channel hidden Markov models; error rate reduction; front-end stress classification; one-channel neutral trained recognition; perceptually induced stress; performance; robust speech recognition systems; speech production domain; speech variations; state level stress classification; stress dependent isolated word recognition system; stress dependent recognition algorithm; stress independent recognition; stressed speech classification; stressed speech production; stressed speech recognition; sub-phoneme stress classification; Aircraft; Error analysis; Hidden Markov models; Laboratories; Natural languages; Robustness; Speech processing; Speech recognition; Stress; Telephony;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.799692

Filename :

799692

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1257626