• DocumentCode
    2406564
  • Title

    Emotions in Hindi speech- analysis, perception and recognition

  • Author

    Agrawal, S.S.

  • Author_Institution
    Coll. of Eng., KIIT, Gurgaon, India
  • fYear
    2011
  • fDate
    26-28 Oct. 2011
  • Firstpage
    7
  • Lastpage
    13
  • Abstract
    Human Speech conveys speaker´s emotional state along with linguistic intelligence. Meaning of a speech sample changes when it is uttered with different emotions. The present paper gives a description of different types of studies conducted to analyze, perceive and recognize commonly occurring emotions in Hindi speech. These have been classified as anger, happiness, fear, sadness, surprise in addition to neutral. Intonation, intensity and duration patterns changes due to changes in sentence types as well as due to changes in emotions. A relationship among the measured acoustic parameters and the patterns has been used to classify them. Experiments have been conducted to study and recognise emotions based on phonetic as well as prosodic parameters in the speech samples due to changes in emotions. These parameters include MFCC & their derivatives and prosodic parameters as the F0, A0 and Duration. In one of the experiment vowel segments taken from continuously spoken sentences and in another experiment Hindi digits were used as speech samples for machine recognition of emotions using the Neural Net classifiers. Human perception experiments have been conducted at all levels of experiments and compared the results with machine recognition performance. In most cases it has been found that machine recognition was found to be better compared to human performance. Both Phonetic as well as prosodic parameters play role in identification of emotions.
  • Keywords
    acoustic measurement; emotion recognition; linguistics; natural language processing; neural nets; signal classification; speech recognition; Hindi speech; MFCC; acoustic parameter measurement; emotion machine recognition; human speech; linguistic intelligence; neural net classifiers; phonetic parameter; prosodic parameter; speaker emotional state; vowel segments; Biological neural networks; Databases; Emotion recognition; Humans; Spectrogram; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on
  • Conference_Location
    Hsinchu
  • Print_ISBN
    978-1-4577-0930-2
  • Type

    conf

  • DOI
    10.1109/ICSDA.2011.6085972
  • Filename
    6085972