• DocumentCode
    2692744
  • Title

    The Vera am Mittag German audio-visual emotional speech database

  • Author

    Grimm, Michael ; Kroschel, Kristian ; Narayanan, Shrikanth

  • Author_Institution
    Inst. fur Nachrichtentechnik, Univ. Karlsruhe, Karlsruhe
  • fYear
    2008
  • fDate
    June 23 2008-April 26 2008
  • Firstpage
    865
  • Lastpage
    868
  • Abstract
    The lack of publicly available annotated databases is one of the major barriers to research advances on emotional information processing. In this contribution we present a recently collected database of spontaneous emotional speech in German which is being made available to the research community. The database consists of 12 hours of audio-visual recordings of the German TV talk show ldquoVera am Mittagrdquo, segmented into broadcasts, dialogue acts and utterances. This corpus contains spontaneous and very emotional speech recorded from unscripted, authentic discussions between the guests of the talk show. In addition to the audio-visual data and the segmented utterances we provide emotion labels for a great part of the data. The emotion labels are given on a continuous valued scale for three emotion primitives: valence, activation and dominance, using a large number of human evaluators. Such data is of great interest to all research groups working on spontaneous speech analysis, emotion recognition in both speech and facial expression, natural language understanding, and robust speech recognition.
  • Keywords
    audio databases; audio recording; audio-visual systems; emotion recognition; linguistics; natural language processing; speech recognition; German TV talk show Vera am Mittag; German language; audio-visual recording; emotion recognition; emotional speech database; facial expression; natural language understanding; segmented utterance; speech recognition; spontaneous emotional speech; spontaneous speech analysis; Audio databases; Audio recording; Emotion recognition; Humans; Information processing; Natural languages; Robustness; Speech analysis; Speech recognition; TV broadcasting; Data acquisition; Speech analysis; Speech processing; TV; Video signal processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2008 IEEE International Conference on
  • Conference_Location
    Hannover
  • Print_ISBN
    978-1-4244-2570-9
  • Electronic_ISBN
    978-1-4244-2571-6
  • Type

    conf

  • DOI
    10.1109/ICME.2008.4607572
  • Filename
    4607572