Title :
Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks
Author :
Le, Dat ; Provost, Emily Mower
Author_Institution :
Comput. Sci. & Eng., Univ. of Michigan, Ann Arbor, MI, USA
Abstract :
Research in emotion recognition seeks to develop insights into the temporal properties of emotion. However, automatic emotion recognition from spontaneous speech is challenging due to non-ideal recording conditions and highly ambiguous ground truth labels. Further, emotion recognition systems typically work with noisy high-dimensional data, rendering it difficult to find representative features and train an effective classifier. We tackle this problem by using Deep Belief Networks, which can model complex and non-linear high-level relationships between low-level features. We propose and evaluate a suite of hybrid classifiers based on Hidden Markov Models and Deep Belief Networks. We achieve state-of-the-art results on FAU Aibo, a benchmark dataset in emotion recognition [1]. Our work provides insights into important similarities and differences between speech and emotion.
Keywords :
belief networks; emotion recognition; hidden Markov models; speech recognition; deep belief networks; emotion recognition systems; hidden Markov models; hybrid classifiers; low-level features; noisy high-dimensional data; non ideal recording conditions; spontaneous speech; temporal properties; Computer architecture; Context; Emotion recognition; Hidden Markov models; Speech; Speech recognition; Training; FAU Aibo; deep belief networks; dynamic modeling; emotion classification; spontaneous speech;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location :
Olomouc
DOI :
10.1109/ASRU.2013.6707732