• DocumentCode
    590883
  • Title

    A study of emotional information present in articulatory movements estimated using acoustic-to-articulatory inversion

  • Author

    Jangwon Kim ; Ghosh, Prosenjit ; Sungbok Lee ; Narayanan, Shrikanth S.

  • Author_Institution
    Signal Anal. & Interpretation Lab. (SAIL), Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2012
  • fDate
    3-6 Dec. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This study examines emotion-specific information (ESI) in the articulatory movements estimated using acoustic-to-articulatory inversion on emotional speech. We study two main aspects: (1) the degree of similarity between the pair of estimated and original articulatory trajectories for the same and different emotions and (2) the amount of ESI present in the estimated trajectory. They are evaluated using mean squared error between the articulatory pair and by automated emotion classification. This study uses parallel acoustic and articulatory data in 5 elicited emotions spoken by 3 native American English speakers. We also test emotion classification performance using articulatory trajectories estimated from different acoustic feature sets and they turn out subject-dependent. Experimental results suggest that the ESI in the estimated trajectory, although smaller than that in the direct articulatory measurements, is found to be complementary to that in the prosodic features and hence, suggesting the usefulness of estimated articulatory data for emotions research.
  • Keywords
    emotion recognition; speech; speech processing; American English speakers; acoustic feature sets; acoustic to articulatory inversion; articulatory data; articulatory measurement; articulatory movements; articulatory pair; automated emotion classification; elicited emotion; emotion specific information; emotional information; emotions research; mean squared error; original articulatory trajectory; parallel acoustic; prosodic features; test emotion classification performance; Accuracy; Mel frequency cepstral coefficient; Production; Speech; Support vector machines; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
  • Conference_Location
    Hollywood, CA
  • Print_ISBN
    978-1-4673-4863-8
  • Type

    conf

  • Filename
    6412030