The Vera am Mittag German audio-visual emotional speech database

Author

Grimm, Michael ; Kroschel, Kristian ; Narayanan, Shrikanth

Author_Institution

Inst. fur Nachrichtentechnik, Univ. Karlsruhe, Karlsruhe

fYear

2008

fDate

June 23 2008-April 26 2008

Firstpage

865

Lastpage

868

Abstract

The lack of publicly available annotated databases is one of the major barriers to research advances on emotional information processing. In this contribution we present a recently collected database of spontaneous emotional speech in German which is being made available to the research community. The database consists of 12 hours of audio-visual recordings of the German TV talk show ldquoVera am Mittagrdquo, segmented into broadcasts, dialogue acts and utterances. This corpus contains spontaneous and very emotional speech recorded from unscripted, authentic discussions between the guests of the talk show. In addition to the audio-visual data and the segmented utterances we provide emotion labels for a great part of the data. The emotion labels are given on a continuous valued scale for three emotion primitives: valence, activation and dominance, using a large number of human evaluators. Such data is of great interest to all research groups working on spontaneous speech analysis, emotion recognition in both speech and facial expression, natural language understanding, and robust speech recognition.

Keywords

audio databases; audio recording; audio-visual systems; emotion recognition; linguistics; natural language processing; speech recognition; German TV talk show Vera am Mittag; German language; audio-visual recording; emotion recognition; emotional speech database; facial expression; natural language understanding; segmented utterance; speech recognition; spontaneous emotional speech; spontaneous speech analysis; Audio databases; Audio recording; Emotion recognition; Humans; Information processing; Natural languages; Robustness; Speech analysis; Speech recognition; TV broadcasting; Data acquisition; Speech analysis; Speech processing; TV; Video signal processing;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia and Expo, 2008 IEEE International Conference on

Conference_Location

Hannover

Print_ISBN

978-1-4244-2570-9

Electronic_ISBN

978-1-4244-2571-6

Type

conf

DOI

10.1109/ICME.2008.4607572

Filename

4607572