Title :
Subband based classification of speech under stress
Author :
Sarikaya, Ruhi ; Gowdy, John N.
Author_Institution :
Digital Speech & Audio Process. Lab., Clemson Univ., SC, USA
Abstract :
This study proposes a new set of feature parameters based on subband analysis of the speech signal for classification of speech under stress. The new speech features are scale energy (SE), autocorrelation-scale-energy (ACSE), subband based cepstral parameters (SC), and autocorrelation-SC (ACSC). The parameters´ ability to capture different stress types is compared to widely used mel-scale cepstrum based representations: mel-frequency cepstral coefficients (MFCC) and autocorrelation-mel-scale (AC-mel). Next, a feedforward neural network is formulated for speaker-dependent stress classification of 10 stress conditions: angry, clear, cond50/70, fast, loud, lombard, neutral, question, slow, and soft. The classification algorithm is evaluated using a previously established stressed speech database (SUSAS) (Hansen and Bou-Ghazale 1997). Subband based features are shown to achieve +7.3% and +9.1% increase in the classification rates over the MFCC based parameters for ungrouped and grouped stress closed vocabulary test scenarios respectively. Moreover the average scores across the simulations of new features are +8.6% and +13.6% higher than MFCC based features for the ungrouped and grouped stress test scenarios respectively
Keywords :
correlation methods; feature extraction; feedforward neural nets; pattern classification; speech recognition; AC-mel; ACSC; ACSE; MFCC; SC; SE; SUSAS; autocorrelation-SC; autocorrelation-mel-scale; autocorrelation-scale-energy; feature parameters; feedforward neural network; grouped stress closed vocabulary test scenario; mel-frequency cepstral coefficients; mel-scale cepstrum based representations; scale energy; speaker-dependent stress classification; speech under stress; stress conditions; stress types; subband analysis; subband based cepstral parameters; subband based classification; ungrouped stress closed vocabulary test scenario; Autocorrelation; Cepstral analysis; Cepstrum; Feedforward neural networks; Mel frequency cepstral coefficient; Neural networks; Signal analysis; Speech analysis; Stress; Testing;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.674494