DocumentCode :
251374
Title :
Significance of acoustic features for designing an emotion classification system
Author :
Kumar, Sudhakar ; Das, Tushar Kanti ; Laskar, Rabul Hussain
Author_Institution :
Sch. of Electron. & Commun. Eng., Shri Mata Vaishno Devi Univ., Jammu, India
fYear :
2014
fDate :
20-22 Dec. 2014
Firstpage :
128
Lastpage :
131
Abstract :
This paper reports some of the observations carried out on SUSE database for emotion classification. A comparative study is made to evaluate the performance of Linear Prediction Cepstral Coefficients (LPCCs) and Mel Frequency Cepstral Coefficients (MFCCs) for designing the emotion classification system for word level utterances. The significance of the orders of the coefficients has been carried out during this study. The results obtained using 12th order LPCC and 13th order MFCC are compared with respect to their reduced dimensions of lower orders. A new classification system based on the feature extraction technique using the 2nd, 3rd and 4th order coefficients of both MFCCs and LPCCs is also proposed. This paper compares the accuracy level of both MFCC and LPCC, enabling us to decide which orders of the parameters (both MFCC and LPCC) are more efficient in conveying the emotion for word level utterances. The initial experiments performed at word level utterances reveal that LPCC is more efficient in detecting emotion as compared to MFCC. Further, we noticed that the emotions conveyed in word level utterances are detected more accurately than that in sentence level utterances. The result suggests that word level approach provides better performance for emotion classification as compared to sentence level approach if the system is designed using vocal tract information only.
Keywords :
cepstral analysis; emotion recognition; feature extraction; human computer interaction; speech processing; LPCC; MFCC; SUSE database; acoustic features; emotion classification system; emotion detection; feature extraction technique; linear prediction cepstral coefficients; mel frequency cepstral coefficients; sentence level utterances; vocal tract information; word level utterances; Accuracy; Databases; Emotion recognition; Feature extraction; Mel frequency cepstral coefficient; Speech; Classification; Emotion; LP Residual; LPCC; MFCC; Mean Value Distance; Normalized Mean; Vocal tract Characteristics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering (ICECE), 2014 International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4799-4167-4
Type :
conf
DOI :
10.1109/ICECE.2014.7026962
Filename :
7026962
Link To Document :
بازگشت