Title of article
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Author/Authors
Bashirpour، M نويسنده , , Geravanchizadeh، M نويسنده ,
Issue Information
فصلنامه با شماره پیاپی سال 2016
Pages
9
From page
197
To page
205
Abstract
Abstract: Automatic recognition of speech emotional states in noisy conditions has
become an important research topic in the emotional speech recognition area, in recent
years. This paper considers the recognition of emotional states via speech in real
environments. For this task, we employ the power normalized cepstral coefficients (PNCC)
in a speech emotion recognition system. We investigate its performance in emotion
recognition using clean and noisy speech materials and compare it with the performances of
the well-known MFCC, LPCC, RASTA-PLP, and also TEMFCC features. Speech samples
are extracted from the Berlin emotional speech database (Emo DB) and Persian emotional
speech database (Persian ESD) which are corrupted with 4 different noise types under
various SNR levels. The experiments are conducted in clean train/noisy test scenarios to
simulate practical conditions with noise sources. Simulation results show that higher
recognition rates are achieved for PNCC as compared with the conventional features under
noisy conditions.
Keywords
Speech Emotion Recognition , Power Normalized Cepstral Coefficients (PNCC) , Noisy Acoustic Condition , Noise Robust Auditory Feature
Journal title
Iranian Journal of Electrical and Electronic Engineering(IJEEE)
Serial Year
2016
Journal title
Iranian Journal of Electrical and Electronic Engineering(IJEEE)
Record number
2402602
Link To Document