Title :
Extraction of adaptive wavelet packet filter-bank-based acoustic feature for speech emotion recognition
Author :
Yongming Huang ; Ao Wu ; Guobao Zhang ; Yue Li
Author_Institution :
Sch. of Autom., Southeast Univ., Nanjing, China
Abstract :
In this paper, a wavelet packet (WP)-based acoustic feature extraction approach is proposed for automatic speech emotion recognition (SER). First, the issue of optimising the WP filter-bank structure for giving classification task is presented as a tree pruning problem, and different tree-pruning criteria are investigated. On this basis, a novel WP-based feature is introduced for SER, namely discriminative band WP power coefficients. Finally, a SER system is built and extensive experiments are carried out. Experimental results show that the proposed feature considerably improves emotion recognition performance over conventional mel frequency cepstrum coefficient (MFCC) feature. The proposed feature extraction approach is promising since it can be easily extended to two-dimensional (2D) facial expression analysis with 2D WP quadtree structures, and further a high-quality audio-visual bimodal emotion recognition system is desirable.
Keywords :
adaptive filters; channel bank filters; emotion recognition; feature extraction; quadtrees; speech recognition; 2D WP quadtree structures; 2D facial expression analysis; WP-based acoustic feature extraction approach; adaptive wavelet packet filter-bank-based acoustic feature extraction; automatic SER; automatic speech emotion recognition; classification task; conventional MFCC feature; discriminative band WP power coefficients; high-quality audio-visual bimodal emotion recognition system; tree pruning problem; tree-pruning criteria; two-dimensional facial expression analysis;
Journal_Title :
Signal Processing, IET
DOI :
10.1049/iet-spr.2013.0446