• DocumentCode
    243292
  • Title

    Incorporation of happiness into neutral speech by modifying emotive-keywords

  • Author

    Rachel, G. Anushiya ; Sreenidhi, S. ; Vijayalakshmi, P. ; Nagarajan, T.

  • Author_Institution
    Speech Lab., SSN Coll. of Eng., Chennai, India
  • fYear
    2014
  • fDate
    22-25 Oct. 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Text-to-speech synthesis systems are expected to produce speech that is intelligible and natural. While conventional systems are capable of producing highly intelligible speech, naturalness needs to be improved, in the sense that regardless of the context, any given text is synthesized in a neutral tone. A number of existing techniques to synthesize emotional speech are data driven. However, collecting a large amount of emotional data is tedious. Therefore, signal processing algorithms can be used to modify neutral speech. The current work concentrates on incorporating happiness into neutral speech. Analysis reveals that happiness in speech primarily affects the pitch contour and the intensity of speech, and variations in these features are predominantly observed only in the emotive-keywords. Therefore, in the current work neutral speech is transformed to happy speech, by using signal processing algorithms to modify the pitch and intensity of the emotive-keywords. The happy speech synthesized by the proposed method, when assessed subjectively, yields a mean opinion score of 2.53 out of a possible 3. The synthetic speech is also assessed objectively using a GMM-based emotion recognition system, and all the tested sentences are recognized to be happy.
  • Keywords
    Gaussian processes; speech synthesis; GMM-based emotion recognition system; Gaussian mixture model; emotional speech synthesis; emotive-keywords modification; happiness; neutral speech modification; pitch contour; signal processing algorithms; speech intensity; speech production; text-to-speech synthesis system; Databases; Hidden Markov models; Polynomials; Signal processing algorithms; Speech; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2014 - 2014 IEEE Region 10 Conference
  • Conference_Location
    Bangkok
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4799-4076-9
  • Type

    conf

  • DOI
    10.1109/TENCON.2014.7022458
  • Filename
    7022458