• DocumentCode
    3716069
  • Title

    Arabic speaker emotion classification using rhythm metrics and neural networks

  • Author

    Ali Mefiah;Yousef A. Alotaibi;Sid-Ahmed Selouani

  • Author_Institution
    College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
  • fYear
    2015
  • Firstpage
    1426
  • Lastpage
    1430
  • Abstract
    In this paper, rhythm metrics are calculated and used to classify five Arabic speech emotions; namely, neutral, sad, happy, surprised, and angry. Eight speakers (four male and four female) simulated the five emotions in their speech by speaking three selected sentences two times each. A human perception test was conducted using nine listeners (male and female). The results of a neural network-based automatic emotion recognition system using rhythm metrics were similar to the human perception test results, although less accurate. Anger was the most recognized speaker emotion and happiness was the least. One of our findings is that the emotions of male speakers are easier to recognize than those of female speakers. In addition, we found that the neural networks and rhythm metrics can be used for speaker emotion recognition using speech signals, but only when the dataset size is large enough.
  • Keywords
    "Speech","Rhythm","Measurement","Speech processing","Speech recognition","Emotion recognition","Feature extraction"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2015 23rd European
  • Electronic_ISBN
    2076-1465
  • Type

    conf

  • DOI
    10.1109/EUSIPCO.2015.7362619
  • Filename
    7362619