• DocumentCode
    153630
  • Title

    Automatic emotion variation detection using multi-scaled sliding window

  • Author

    Yuchao Fan ; Mingxing Xu ; Zhiyong Wu ; Lianhong Cai

  • Author_Institution
    Minist. of Educ. Tsinghua Nat. Lab. for Inf. Sci. & Technol. (TNList) Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • fYear
    2014
  • fDate
    20-23 Sept. 2014
  • Firstpage
    232
  • Lastpage
    236
  • Abstract
    Emotion recognition from speech plays an important role in developing affective and intelligent Human Computer Interaction. The goal of this work is to build an Automatic Emotion Variation Detection (AEVD) system to determine each emotional salient segment in continuous speech. We focus on emotion detection in angry-neutral speech, which is common in recent studies of AEVD. This study proposes a novel framework for AEVD using Multi-scaled Sliding Window (MSW-AEVD) to assign an emotion class to each window-shift by fusion decisions of all the sliding windows containing the shift. Firstly, sliding window with fixed-length is introduced as the basic procedure, in which several different fusion methods are investigated. Then multi-scaled sliding window is employed to support multi-classifiers with different timescale features, in which another two fusion strategies are provided. Finally, a postprocessing is applied to refine the final outputs. Performance evaluation is carried out on the public Berlin database EMO-DB. Our experimental results show that proposed MSW-AEVD significantly outperforms the traditional HMM-based AEVD.
  • Keywords
    emotion recognition; human computer interaction; sensor fusion; signal classification; speech recognition; AEVD system; EMO-DB; MSW-AEVD; affective human computer interaction; angry-neutral speech; automatic emotion variation detection; continuous speech; emotion class assignment; emotion detection; emotion recognition; emotional salient segment; fixed-length sliding window; fusion decision; fusion method; fusion strategy; intelligent human computer interaction; multiclassifiers; multiscaled sliding window; performance evaluation; public Berlin database; window-shift; Accuracy; Emotion recognition; Feature extraction; Hidden Markov models; Speech; Speech recognition; Support vector machines; Emotion Detection; Emotion Variation; Multi-Scaled Sliding Window;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Orange Technologies (ICOT), 2014 IEEE International Conference on
  • Conference_Location
    Xian
  • Type

    conf

  • DOI
    10.1109/ICOT.2014.6956642
  • Filename
    6956642