Automatic Staging of Audio with Emotions

Author

Saheer, Lakshmi ; Cernak, M.

fYear

2013

fDate

2-5 Sept. 2013

Firstpage

705

Lastpage

706

Abstract

Current day text-to-speech technologies are mature enough to be acceptable in quality for the users. There is still a large gap between a synthesised speech and a real human speech due to lack of expressions and emotions. Geneemo is a technology for automatic addition of emotions and expressions to any audio. The process of staging the text is to dramatize it. The text is enriched and transformed into a performance. Similarly, "staging the audio" refers to extending text dramatisation to audio by enriching emotionally neutral audio content into a natural human speech with real expressions. The audio can be generated by any text-to-speech technology. The aim of the project is to make human computer interactions as natural as possible with expressive speech. This also opens up a portfolio of applications replacing real human voices.

Keywords

emotion recognition; human computer interaction; speech synthesis; Geneemo; audio automatic staging; expressive speech; human computer interactions; text dramatisation; text-to-speech technology; Abstracts; Affective computing; Human computer interaction; Markov processes; Speech; Speech recognition; Speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Affective Computing and Intelligent Interaction (ACII), 2013 Humaine Association Conference on

Conference_Location

Geneva

ISSN

2156-8103

Type

conf

DOI

10.1109/ACII.2013.124

Filename

6681515