DocumentCode :
240353
Title :
Speech emotion identification analysis based on different spectral feature extraction methods
Author :
Kamaruddin, Norhaslinda ; Abdul Rahman, Abdul Wahab ; Abdullah, Nor Sakinah
Author_Institution :
Fac. of Comput. & Math. Sci., MARA Univ. of Technol., Jasin, Malaysia
fYear :
2014
fDate :
17-18 Nov. 2014
Firstpage :
1
Lastpage :
5
Abstract :
Human speech communication will convey semantic information of the uttered word as well as the underlying emotion information of the interlocutor. Emotion identification is important, as it could enhance many applications added-features that can improve human computer interaction aspect. Such improvement surely can help to retain customer satisfaction and loyalty in the long run and serves as an attraction factor for a new customer. Although many researchers have used many approaches to recognize emotion from speech, no one can claim superiority of their findings. This is because different feature extraction methods coupled with various classifiers may produce different performance depending on the data used. This paper presents a comparative analysis of the speech emotion identification system using two different feature extraction methods of Mel Frequency Cepstral Coefficient (MFCC) and Linear Prediction Coefficient (LPC) coupled with Multilayer Perceptron (MLP) classifier. For further exploration, different numbers of MFCC filters are employed to observe the performance of the proposed system. The results indicate that MFCC-40 gives slightly better performance compared to the other MFCC coefficients in the Berlin EMO-DB and NTU_American whereas the MFCC-20 performs well for NTU_Asian. It is also observed that MFCC consistently performed better than LPC in all experiments, which are in-line with many reported findings. Such understanding can be extended to further study speech emotion in order to develop more robust and least error system in the future.
Keywords :
emotion recognition; feature extraction; speech processing; Berlin EMO-DB; LPC; MFCC coefficients; MFCC filters; MFCC-40; MLP classifier; customer satisfaction; emotion information; human computer interaction aspect; human speech communication; interlocutor; least error system; linear prediction coefficient; mel frequency cepstral coefficient; multilayer perceptron; semantic information; spectral feature extraction methods; speech emotion identification analysis; speech emotion identification system; uttered word; Accuracy; Bandwidth; Feature extraction; Filter banks; Mel frequency cepstral coefficient; Speech; Speech recognition; Cultural influence on speech emotion; Linear Prediction Coefficient (LPC); Mel Frequency CepstraI Coefficien (MFCC); Speech emotion; Speech emotion identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technology for The Muslim World (ICT4M), 2014 The 5th International Conference on
Conference_Location :
Kuching
Type :
conf
DOI :
10.1109/ICT4M.2014.7020588
Filename :
7020588
Link To Document :
بازگشت