DocumentCode :
252473
Title :
A feature extraction scheme based on enhanced wavelet coefficients for Speech Emotion Recognition
Author :
Shahnaz, Celia ; Sultana, Shabana
Author_Institution :
Dept. of Electr. & Electron. Eng., Bangladesh Univ. of Eng. & Technol., Dhaka, Bangladesh
fYear :
2014
fDate :
3-6 Aug. 2014
Firstpage :
1093
Lastpage :
1096
Abstract :
This paper proposes a new feature extraction scheme for speaker-independent Speech Emotion Recognition under both unsupervised and supervised conditions following a non-hierarchical process first and then a hierarchical one. The feature is derived from the Teager energy operated wavelet coefficients of speech signal. The wavelet coefficients enhanced by TE operation is used to compute entropy thus forming a feature vector. The feature vector is fed to unsupervised K-means clustering in a nonhierarchical process. Considering the effectiveness of supervised classification in a recognition problem, the feature is then used in a supervised KNN classifier. It is seen that supervised KNN classifier is more capable of distinguishing different emotions when a hierarchical approach is followed for recognition instead of a non-hierarchical one. Detail simulations are carried on EMO-DB German speech emotion database containing four class emotions, such as angry, happy, sad and neutral. Simulation results show that the proposed feature with supervised hierarchical classification approach provides the higher accuracy in comparison to the other methods of four-class emotion recognition.
Keywords :
emotion recognition; feature extraction; learning (artificial intelligence); pattern clustering; signal classification; speaker recognition; wavelet transforms; EMO-DB German speech emotion database; TE operation; Teager energy; angry; entropy; feature extraction scheme; feature vector; four-class emotion recognition; happy; neutral; nonhierarchical process; recognition problem; sad; speaker-independent speech emotion recognition; speech signal; supervised KNN classifier; supervised conditions; supervised hierarchical classification; unsupervised K-means clustering; wavelet coefficients; Accuracy; Approximation methods; Discrete wavelet transforms; Emotion recognition; Entropy; Speech; Speech recognition; Entropy; Hierarchical; K-means; KNN; Speaker-independent; Teager Energy; Wavelet;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems (MWSCAS), 2014 IEEE 57th International Midwest Symposium on
Conference_Location :
College Station, TX
ISSN :
1548-3746
Print_ISBN :
978-1-4799-4134-6
Type :
conf
DOI :
10.1109/MWSCAS.2014.6908609
Filename :
6908609
Link To Document :
بازگشت