• DocumentCode
    3641181
  • Title

    A novel perceptual feature set for audio emotion recognition

  • Author

    Mehmet Cenk Sezgin;Bilge Günsel;Güneş Karabulut Kurt

  • Author_Institution
    Department of Electronics and Communications Engineering, İ
  • fYear
    2011
  • fDate
    3/1/2011 12:00:00 AM
  • Firstpage
    780
  • Lastpage
    785
  • Abstract
    We present a novel system for audio emotion recognition based on the Perceptual Evaluation of Audio Quality (PEAQ) model as described by the standard, ITU-R BS.1387-1 which provides a mathematical model resembling the human auditory system. The introduced feature set performs perceptual analysis in time, spectral and Bark domains thus enabling us to represent the statistics of emotional audio for arousal and valence modes with a small number of features. Unlike the existing systems, the proposed feature set learns statistical characteristic of emotional differences hence does not require data normalization to eliminate speaker or corpus dependency. Recognition performance obtained for the well known VAM and EMO-DB corpora show that the classification accuracy achieved by the proposed feature set outperforms the reported benchmarking results particularly for valence both for natural and acted emotional data.
  • Keywords
    "Psychoacoustic models","Modulation","Feature extraction","Emotion recognition","Training","Bandwidth","Ear"
  • Publisher
    ieee
  • Conference_Titel
    Automatic Face & Gesture Recognition and Workshops (FG 2011), 2011 IEEE International Conference on
  • Print_ISBN
    978-1-4244-9140-7
  • Type

    conf

  • DOI
    10.1109/FG.2011.5771348
  • Filename
    5771348