DocumentCode
1691207
Title
Automatic detection of anger in telephone speech with robust autoregressive modulation filtering
Author
Pohjalainen, Jouni ; Alku, Paavo
Author_Institution
Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland
fYear
2013
Firstpage
7537
Lastpage
7541
Abstract
A new system for automatic detection of angry speech is proposed. Using simulation of far-end-noise-corrupted telephone speech and the widely used Berlin database of emotional speech, autoregressive prediction of features across speech frames is shown to contribute significantly to both the clean speech performance and the robustness of the system. The autoregressive models are learned from the training data in order to capture long-term temporal dynamics of the features. Additionally, linear predictive spectrum analysis outperforms conventional Fourier spectrum analysis in terms of robustness in the computation of mel-frequency cepstral coefficients in the feature extraction stage.
Keywords
autoregressive processes; emotion recognition; filtering theory; speech recognition; telephony; Berlin database; anger detection; angry speech; automatic detection; emotional speech; far-end-noise-corrupted telephone speech; feature extraction; linear predictive spectrum analysis; mel-frequency cepstral coefficient; robust autoregressive modulation filtering; temporal dynamics; Databases; Emotion recognition; Feature extraction; Noise; Robustness; Speech; Speech recognition; emotion detection; speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639128
Filename
6639128
Link To Document