مرکز منطقه ای اطلاع رساني علوم و فناوري - A feature extraction scheme based on enhanced wavelet coefficients for Speech Emotion Recognition

DocumentCode :

252473

Title :

A feature extraction scheme based on enhanced wavelet coefficients for Speech Emotion Recognition

Author :

Shahnaz, Celia ; Sultana, Shabana

Author_Institution :

Dept. of Electr. & Electron. Eng., Bangladesh Univ. of Eng. & Technol., Dhaka, Bangladesh

fYear :

2014

fDate :

3-6 Aug. 2014

Firstpage :

1093

Lastpage :

1096

Abstract :

This paper proposes a new feature extraction scheme for speaker-independent Speech Emotion Recognition under both unsupervised and supervised conditions following a non-hierarchical process first and then a hierarchical one. The feature is derived from the Teager energy operated wavelet coefficients of speech signal. The wavelet coefficients enhanced by TE operation is used to compute entropy thus forming a feature vector. The feature vector is fed to unsupervised K-means clustering in a nonhierarchical process. Considering the effectiveness of supervised classification in a recognition problem, the feature is then used in a supervised KNN classifier. It is seen that supervised KNN classifier is more capable of distinguishing different emotions when a hierarchical approach is followed for recognition instead of a non-hierarchical one. Detail simulations are carried on EMO-DB German speech emotion database containing four class emotions, such as angry, happy, sad and neutral. Simulation results show that the proposed feature with supervised hierarchical classification approach provides the higher accuracy in comparison to the other methods of four-class emotion recognition.

Keywords :

emotion recognition; feature extraction; learning (artificial intelligence); pattern clustering; signal classification; speaker recognition; wavelet transforms; EMO-DB German speech emotion database; TE operation; Teager energy; angry; entropy; feature extraction scheme; feature vector; four-class emotion recognition; happy; neutral; nonhierarchical process; recognition problem; sad; speaker-independent speech emotion recognition; speech signal; supervised KNN classifier; supervised conditions; supervised hierarchical classification; unsupervised K-means clustering; wavelet coefficients; Accuracy; Approximation methods; Discrete wavelet transforms; Emotion recognition; Entropy; Speech; Speech recognition; Entropy; Hierarchical; K-means; KNN; Speaker-independent; Teager Energy; Wavelet;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Circuits and Systems (MWSCAS), 2014 IEEE 57th International Midwest Symposium on

Conference_Location :

College Station, TX

ISSN :

1548-3746

Print_ISBN :

978-1-4799-4134-6

Type :

conf

DOI :

10.1109/MWSCAS.2014.6908609

Filename :

6908609

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=252473