• DocumentCode
    652805
  • Title

    Automatic Staging of Audio with Emotions

  • Author

    Saheer, Lakshmi ; Cernak, M.

  • fYear
    2013
  • fDate
    2-5 Sept. 2013
  • Firstpage
    705
  • Lastpage
    706
  • Abstract
    Current day text-to-speech technologies are mature enough to be acceptable in quality for the users. There is still a large gap between a synthesised speech and a real human speech due to lack of expressions and emotions. Geneemo is a technology for automatic addition of emotions and expressions to any audio. The process of staging the text is to dramatize it. The text is enriched and transformed into a performance. Similarly, "staging the audio" refers to extending text dramatisation to audio by enriching emotionally neutral audio content into a natural human speech with real expressions. The audio can be generated by any text-to-speech technology. The aim of the project is to make human computer interactions as natural as possible with expressive speech. This also opens up a portfolio of applications replacing real human voices.
  • Keywords
    emotion recognition; human computer interaction; speech synthesis; Geneemo; audio automatic staging; expressive speech; human computer interactions; text dramatisation; text-to-speech technology; Abstracts; Affective computing; Human computer interaction; Markov processes; Speech; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Affective Computing and Intelligent Interaction (ACII), 2013 Humaine Association Conference on
  • Conference_Location
    Geneva
  • ISSN
    2156-8103
  • Type

    conf

  • DOI
    10.1109/ACII.2013.124
  • Filename
    6681515