DocumentCode :
3716069
Title :
Arabic speaker emotion classification using rhythm metrics and neural networks
Author :
Ali Mefiah;Yousef A. Alotaibi;Sid-Ahmed Selouani
Author_Institution :
College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
fYear :
2015
Firstpage :
1426
Lastpage :
1430
Abstract :
In this paper, rhythm metrics are calculated and used to classify five Arabic speech emotions; namely, neutral, sad, happy, surprised, and angry. Eight speakers (four male and four female) simulated the five emotions in their speech by speaking three selected sentences two times each. A human perception test was conducted using nine listeners (male and female). The results of a neural network-based automatic emotion recognition system using rhythm metrics were similar to the human perception test results, although less accurate. Anger was the most recognized speaker emotion and happiness was the least. One of our findings is that the emotions of male speakers are easier to recognize than those of female speakers. In addition, we found that the neural networks and rhythm metrics can be used for speaker emotion recognition using speech signals, but only when the dataset size is large enough.
Keywords :
"Speech","Rhythm","Measurement","Speech processing","Speech recognition","Emotion recognition","Feature extraction"
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN :
2076-1465
Type :
conf
DOI :
10.1109/EUSIPCO.2015.7362619
Filename :
7362619
Link To Document :
بازگشت