Title :
Angry emotion detection from real-life conversational speech by leveraging content structure
Author :
Kim, Wooil ; Hansen, John H L
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
This study proposes an effective angry speech detection approach by leveraging content structure within the input speech. A classifier based on an “emotional” language model score is formulated and combined with acoustic feature based classifiers including TEO-based feature and conventional Mel frequency cepstral coefficients (MFCC). The proposed detection algorithm is evaluated on real-life conversational speech which was recorded between customers and call center operators over a telephone network. Analysis on the conversational speech corpus presents a distinctive property between neutral and angry speech in word distribution and frequently occurring words. An improvement of up to 6.23% in Equal Error Rate (EER) is obtained by combining the TEO-based and MFCC features, and emotional language model score based classifiers.
Keywords :
emotion recognition; natural languages; speech; angry emotion detection; conventional Mel frequency cepstral coefficients; emotional language model; equal error rate; leveraging content structure; real-life conversational speech; telephone network; Acoustic signal detection; Cepstral analysis; Mel frequency cepstral coefficient; Natural languages; Robustness; Speech analysis; Speech enhancement; Speech processing; Stress; Telephony; TEO-based feature; angry speech detection; classifier combination; content structure; emotional language model;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495021