مرکز منطقه ای اطلاع رساني علوم و فناوري - Cepstral domain talker stress compensation for robust speech recognition

DocumentCode :

750006

Title :

Cepstral domain talker stress compensation for robust speech recognition

Author :

Chen, Yeunung

Author_Institution :

MIT Lincoln Lab., Lexington, MA, USA

Volume :

Issue :

fYear :

1988

fDate :

4/1/1988 12:00:00 AM

Firstpage :

433

Lastpage :

439

Abstract :

A study of talker-stress-induced intraword variability and an algorithm that compensates for the systematic changes observed are presented. The study is based on hidden Markov models trained by speech tokens spoken in various talking styles. The talking styles include normal speech, fast speech, loud speech, soft speech, and taking with noise injected through earphones; the styles are designed to simulate speech produced under real stressful conditions. Cepstral coefficients are used as the parameters in the hidden Markov models. The stress compensation algorithm compensates for the variations in the cepstral coefficients in a hypothesis-driven manner. The functional form of the compensation is shown to correspond to the equalization of spectral tilts. Substantial reduction of error rates has been achieved when the cepstral domain compensation techniques were tested on the simulated-stress speech database. The hypothesis-driven compensation technique reduced the average error rate from 13.9% to 6.2%. When a more sophisticated recognizer was used, it reduced the error rate from 2.5% to 1.9%

Keywords :

Markov processes; speech recognition; cepstral coefficients; cepstral domain compensation; earphones; error rates; fast speech; hidden Markov models; hypothesis-driven compensation; loud speech; noise; normal speech; soft speech; speech recognition; speech tokens; stress compensation algorithm; talker stress compensation; talking styles; Cepstral analysis; Databases; Degradation; Error analysis; Hidden Markov models; Human factors; Robustness; Speech enhancement; Speech recognition; Stress;

fLanguage :

English

Journal_Title :

Acoustics, Speech and Signal Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

0096-3518

Type :

jour

DOI :

10.1109/29.1547

Filename :

1547

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=750006