DocumentCode :
3191540
Title :
Classification of Stressed Speech using Gaussian Mixture Model
Author :
Patro ; Raja, G. Senthil ; Dandapat, S.
Author_Institution :
Dept. of ECE, Indian Institute of Technology, Guwahati, India, patro@iitg.ernet.in
fYear :
2005
fDate :
11-13 Dec. 2005
Firstpage :
342
Lastpage :
346
Abstract :
In this work, different speech features, such as Sinusoidal Frequency Features (SFF), Sinusoidal Amplitude Features (SAF), Cepstral Coefficients (CC) and Mel Frequency Cepstral Coefficients (MFCC) are evaluated to find out their relative effectiveness to represent the stressed speech. Different statistical feature evaluation techniques, such as Probability density characteristics, F-ratio test, Kolmogorv-Smirnov test and Vector Quantization (VQ) classifier are used to assess the performances of the speech features. A novel statistical Feature Discrimination Measure (FDM) is proposed for the same purpose. Gaussian Mixture Model (GMM) classifier is tested for recognition of different stress levels in a speech signal. Speech Under Simulated Emotion (SUSE) database has been used for stress analysis. SAF shows maximum recognition result followed by SFF, MFCC and CC respectively with both GMM and VQ classifier. FDM values and KS test suggest similar performance for the speech features. F-ratio values indicate best performance with SFF followed by SAF, MFCC and CC respectively.
Keywords :
F-ratio and FDM; Feature evaluation; GMM; Kolmogorov-Smirnov test; Analytical models; Cepstral analysis; Mel frequency cepstral coefficient; Performance evaluation; Probability; Speech analysis; Speech recognition; Stress; Testing; Vector quantization; F-ratio and FDM; Feature evaluation; GMM; Kolmogorov-Smirnov test;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
INDICON, 2005 Annual IEEE
Print_ISBN :
0-7803-9503-4
Type :
conf
DOI :
10.1109/INDCON.2005.1590186
Filename :
1590186
Link To Document :
بازگشت