مرکز منطقه ای اطلاع رساني علوم و فناوري - A personalized emotion recognition system using an unsupervised feature adaptation scheme

DocumentCode :

3167329

Title :

A personalized emotion recognition system using an unsupervised feature adaptation scheme

Author :

Rahman, Tauhidur ; Busso, Carlos

Author_Institution :

Dept. of Electr. Eng., Univ. of Texas at Dallas, Dallas, TX, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

5117

Lastpage :

5120

Abstract :

A personalized emotion recognition system aims to tune the model to recognize the expressive behaviors of a targeted person. Such a system can play an important role in various domains including call center and health care applications. Adapting any general emotion recognition system for a particular individual requires speech samples and prior knowledge about their emotional content. These assumptions constrain the use of these techniques in many real scenarios in which no annotated data is available to train or adapt the models. To address this problem, this paper introduces an unsupervised feature adaptation scheme that aims to reduce the mismatch between the acoustic features used to train the system and the acoustic features extracted from the unknown targeted speaker. The adaptation scheme uses our recently proposed iterative feature normalization (IFN) framework. An emotion detection system is trained with the IEMOCAP database. For testing, a database was created by downloading videos from a video-sharing website, containing various interviews from a targeted subject (1.5 hours). The detection system is used to identify emotional speech with and without the proposed feature adaptation scheme. The experimental results indicate that the proposed approach improves the unweighted accuracy from 50.8% to 70.0%.

Keywords :

emotion recognition; feature extraction; iterative methods; speech recognition; unsupervised learning; IEMOCAP database; IFN framework; acoustic feature extraction; acoustic features; emotional speech detection system; iterative feature normalization framework; personalized emotion recognition system; speech samples; unsupervised feature adaptation scheme; video-sharing Website; Accuracy; Acoustics; Databases; Emotion recognition; Feature extraction; Speech; Testing; Personalized emotion recognition; feature adaptation; front-end feature normalization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6289072

Filename :

6289072

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3167329