DocumentCode
2692744
Title
The Vera am Mittag German audio-visual emotional speech database
Author
Grimm, Michael ; Kroschel, Kristian ; Narayanan, Shrikanth
Author_Institution
Inst. fur Nachrichtentechnik, Univ. Karlsruhe, Karlsruhe
fYear
2008
fDate
June 23 2008-April 26 2008
Firstpage
865
Lastpage
868
Abstract
The lack of publicly available annotated databases is one of the major barriers to research advances on emotional information processing. In this contribution we present a recently collected database of spontaneous emotional speech in German which is being made available to the research community. The database consists of 12 hours of audio-visual recordings of the German TV talk show ldquoVera am Mittagrdquo, segmented into broadcasts, dialogue acts and utterances. This corpus contains spontaneous and very emotional speech recorded from unscripted, authentic discussions between the guests of the talk show. In addition to the audio-visual data and the segmented utterances we provide emotion labels for a great part of the data. The emotion labels are given on a continuous valued scale for three emotion primitives: valence, activation and dominance, using a large number of human evaluators. Such data is of great interest to all research groups working on spontaneous speech analysis, emotion recognition in both speech and facial expression, natural language understanding, and robust speech recognition.
Keywords
audio databases; audio recording; audio-visual systems; emotion recognition; linguistics; natural language processing; speech recognition; German TV talk show Vera am Mittag; German language; audio-visual recording; emotion recognition; emotional speech database; facial expression; natural language understanding; segmented utterance; speech recognition; spontaneous emotional speech; spontaneous speech analysis; Audio databases; Audio recording; Emotion recognition; Humans; Information processing; Natural languages; Robustness; Speech analysis; Speech recognition; TV broadcasting; Data acquisition; Speech analysis; Speech processing; TV; Video signal processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2008 IEEE International Conference on
Conference_Location
Hannover
Print_ISBN
978-1-4244-2570-9
Electronic_ISBN
978-1-4244-2571-6
Type
conf
DOI
10.1109/ICME.2008.4607572
Filename
4607572
Link To Document