DocumentCode
652805
Title
Automatic Staging of Audio with Emotions
Author
Saheer, Lakshmi ; Cernak, M.
fYear
2013
fDate
2-5 Sept. 2013
Firstpage
705
Lastpage
706
Abstract
Current day text-to-speech technologies are mature enough to be acceptable in quality for the users. There is still a large gap between a synthesised speech and a real human speech due to lack of expressions and emotions. Geneemo is a technology for automatic addition of emotions and expressions to any audio. The process of staging the text is to dramatize it. The text is enriched and transformed into a performance. Similarly, "staging the audio" refers to extending text dramatisation to audio by enriching emotionally neutral audio content into a natural human speech with real expressions. The audio can be generated by any text-to-speech technology. The aim of the project is to make human computer interactions as natural as possible with expressive speech. This also opens up a portfolio of applications replacing real human voices.
Keywords
emotion recognition; human computer interaction; speech synthesis; Geneemo; audio automatic staging; expressive speech; human computer interactions; text dramatisation; text-to-speech technology; Abstracts; Affective computing; Human computer interaction; Markov processes; Speech; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Affective Computing and Intelligent Interaction (ACII), 2013 Humaine Association Conference on
Conference_Location
Geneva
ISSN
2156-8103
Type
conf
DOI
10.1109/ACII.2013.124
Filename
6681515
Link To Document