• DocumentCode
    72114
  • Title

    Classifier-based learning of nonlinear feature manifold for visualization of emotional speech prosody

  • Author

    Vayrynen, E. ; Kortelainen, Jukka ; Seppanen, T.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Oulu, Oulu, Finland
  • Volume
    4
  • Issue
    1
  • fYear
    2013
  • fDate
    Jan.-March 2013
  • Firstpage
    47
  • Lastpage
    56
  • Abstract
    Visualization of emotional speech data is an important tool for speech researchers who seek means to gain a deeper insight into the structure of complex multidimensional data. A visualization method is presented that utilizes feature selection and classifier optimization for learning Isomap manifolds of emotional speech data. The resulting manifold is based on those features that best discriminate between given emotional classes in the target space of specified embedding dimension. A nonlinear mapping function based on generalized regression neural networks (GRNNs) provides generalization for new data. A low-dimensional manifold of emotional speech data consisting of neutral, sad, angry, and happy expressions was constructed using prosodic and acoustic features of speech. Experimental results indicate that a 3D embedding provides the best classification performance. The manifold structure can be readily visualized and matches the circumplex and conical shapes predicted by dimensional models of emotion. Listening tests show excellent correlation between the organization of the data on the manifold and the listeners´ judgment of emotional intensity.
  • Keywords
    data visualisation; generalisation (artificial intelligence); learning (artificial intelligence); neural nets; pattern classification; speech processing; GRNN; Isomap manifold learning; acoustic speech feature; circumplex shape; classification performance; classifier optimization; classifier-based learning; conical shape; data generalization; embedding dimension; emotional class; emotional intensity; emotional speech data visualization; emotional speech prosody visualization; feature selection; generalized regression neural network; listening test; nonlinear feature manifold; nonlinear mapping function; prosodic speech feature; Biological system modeling; Data visualization; Estimation; Machine learning; Manifolds; Principal component analysis; Speech processing; Affect sensing and analysis; Biological system modeling; Data visualization; Estimation; Machine learning; Manifolds; Principal component analysis; Speech processing; feature evaluation and selection; information visualization; machine learning; modeling human emotion; nonlinear manifold learning; speech analysis;
  • fLanguage
    English
  • Journal_Title
    Affective Computing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1949-3045
  • Type

    jour

  • DOI
    10.1109/T-AFFC.2012.35
  • Filename
    6357182