DocumentCode
3716069
Title
Arabic speaker emotion classification using rhythm metrics and neural networks
Author
Ali Mefiah;Yousef A. Alotaibi;Sid-Ahmed Selouani
Author_Institution
College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
fYear
2015
Firstpage
1426
Lastpage
1430
Abstract
In this paper, rhythm metrics are calculated and used to classify five Arabic speech emotions; namely, neutral, sad, happy, surprised, and angry. Eight speakers (four male and four female) simulated the five emotions in their speech by speaking three selected sentences two times each. A human perception test was conducted using nine listeners (male and female). The results of a neural network-based automatic emotion recognition system using rhythm metrics were similar to the human perception test results, although less accurate. Anger was the most recognized speaker emotion and happiness was the least. One of our findings is that the emotions of male speakers are easier to recognize than those of female speakers. In addition, we found that the neural networks and rhythm metrics can be used for speaker emotion recognition using speech signals, but only when the dataset size is large enough.
Keywords
"Speech","Rhythm","Measurement","Speech processing","Speech recognition","Emotion recognition","Feature extraction"
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN
2076-1465
Type
conf
DOI
10.1109/EUSIPCO.2015.7362619
Filename
7362619
Link To Document