• DocumentCode
    252473
  • Title

    A feature extraction scheme based on enhanced wavelet coefficients for Speech Emotion Recognition

  • Author

    Shahnaz, Celia ; Sultana, Shabana

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Bangladesh Univ. of Eng. & Technol., Dhaka, Bangladesh
  • fYear
    2014
  • fDate
    3-6 Aug. 2014
  • Firstpage
    1093
  • Lastpage
    1096
  • Abstract
    This paper proposes a new feature extraction scheme for speaker-independent Speech Emotion Recognition under both unsupervised and supervised conditions following a non-hierarchical process first and then a hierarchical one. The feature is derived from the Teager energy operated wavelet coefficients of speech signal. The wavelet coefficients enhanced by TE operation is used to compute entropy thus forming a feature vector. The feature vector is fed to unsupervised K-means clustering in a nonhierarchical process. Considering the effectiveness of supervised classification in a recognition problem, the feature is then used in a supervised KNN classifier. It is seen that supervised KNN classifier is more capable of distinguishing different emotions when a hierarchical approach is followed for recognition instead of a non-hierarchical one. Detail simulations are carried on EMO-DB German speech emotion database containing four class emotions, such as angry, happy, sad and neutral. Simulation results show that the proposed feature with supervised hierarchical classification approach provides the higher accuracy in comparison to the other methods of four-class emotion recognition.
  • Keywords
    emotion recognition; feature extraction; learning (artificial intelligence); pattern clustering; signal classification; speaker recognition; wavelet transforms; EMO-DB German speech emotion database; TE operation; Teager energy; angry; entropy; feature extraction scheme; feature vector; four-class emotion recognition; happy; neutral; nonhierarchical process; recognition problem; sad; speaker-independent speech emotion recognition; speech signal; supervised KNN classifier; supervised conditions; supervised hierarchical classification; unsupervised K-means clustering; wavelet coefficients; Accuracy; Approximation methods; Discrete wavelet transforms; Emotion recognition; Entropy; Speech; Speech recognition; Entropy; Hierarchical; K-means; KNN; Speaker-independent; Teager Energy; Wavelet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems (MWSCAS), 2014 IEEE 57th International Midwest Symposium on
  • Conference_Location
    College Station, TX
  • ISSN
    1548-3746
  • Print_ISBN
    978-1-4799-4134-6
  • Type

    conf

  • DOI
    10.1109/MWSCAS.2014.6908609
  • Filename
    6908609