DocumentCode
177681
Title
Linked Source and Target Domain Subspace Feature Transfer Learning -- Exemplified by Speech Emotion Recognition
Author
Jun Deng ; Zixing Zhang ; Schuller, B.
Author_Institution
Machine Intell. & Signal Process. Group, Tech. Univ. Munchen, Munich, Germany
fYear
2014
fDate
24-28 Aug. 2014
Firstpage
761
Lastpage
766
Abstract
The typical inherent mismatch between the test and training corpora and by that between ´target´ and ´source´ sets usually leads to significant performance downgrades. To cope with this, this study presents a feature transfer learning method using Denoising Auto encoders (DAEs) to build high order subspaces of the source and target corpora, where features in the source domain are transferred to the target domain by an additional neural network. To exemplify effectiveness of our approach, we select the INTERSPEECH Emotion Challenge´s FAU Aibo Emotion Corpus as target corpus and further two publicly available databases as source corpora for extensive and reproducible evaluation. The experimental results show that our method significantly improves over the baseline performance and outperforms today´s state-of-the-art domain adaptation methods.
Keywords
emotion recognition; neural nets; speech processing; DAE; INTERSPEECH emotion challenge FAU Aibo emotion corpus; denoising auto encoders; linked source; neural network; source corpora; speech emotion recognition; target corpora; target domain subspace feature transfer learning; Artificial neural networks; Databases; Emotion recognition; Noise reduction; Speech; Training; cross-corpus; denoising autoencoders; domain adaptation; feature transfer learning; speech emotion recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location
Stockholm
ISSN
1051-4651
Type
conf
DOI
10.1109/ICPR.2014.141
Filename
6976851
Link To Document