• DocumentCode
    2555461
  • Title

    A novel objective function to optimize neural networks for emotion recognition from speech patterns

  • Author

    Huang, Kuan-Chieh ; Kuo, Yau-Hwang

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • fYear
    2010
  • fDate
    15-17 Dec. 2010
  • Firstpage
    413
  • Lastpage
    417
  • Abstract
    Expressions of different emotions are usually overlapping and hard to distinguish. Besides, the amounts of feature patterns are usually imbalanced in the overlapping emotional expressions, and most conventional classifiers tend to prefer lager classes for archiving a better overall recognition rate. This drawback is also encountered in the Multi-layer perception (MLP) models frequently proposed for emotion recognition due to its superior classification capability and performance. However, MLP and most recognition techniques only refer to a mean square error and an overall error rate. Furthermore, using MLP has another disadvantage that needs to search a suitable network structure. In this paper, a novel objective function to optimize the MLP neural networks is proposed for solving these problems. This function considers the criteria of mean square error, classification error rate, and distances between the examples and the classification boundary, to optimize the network parameters and prune the links between neurons. Besides, the sigmoid and Gaussian transfer functions are adopted in our method to construct suitable classification boundaries. An artificial data set and the Danish emotional speech database are used to verify the MLP-based classifier with the novel objective function. The experimental results show that the proposed model has better performance than conventional MLPs.
  • Keywords
    emotion recognition; human computer interaction; multilayer perceptrons; Danish emotional speech database; Gaussian transfer function; MLP neural network; artificial data set; classification boundary; classification error rate; emotion recognition; mean square error; multilayer perception model; neurons; objective function; speech pattern; Art; Emotion recognition; Noise measurement; Training; affective human-computer interaction; emotion recognition; genetic algorithms; neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Nature and Biologically Inspired Computing (NaBIC), 2010 Second World Congress on
  • Conference_Location
    Fukuoka
  • Print_ISBN
    978-1-4244-7377-9
  • Type

    conf

  • DOI
    10.1109/NABIC.2010.5716361
  • Filename
    5716361