• DocumentCode
    359231
  • Title

    Prosodic enhancements for a musical object oriented formant synthesizer

  • Author

    Politis, Dionysios

  • Author_Institution
    Multimedia Lab., Aristotelian Univ. of Thessaloniki, Greece
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    702
  • Abstract
    This paper describes a real time object oriented formant synthesizer with musical capabilities. These capabilities originate from the support of the SKDNL and MIDI specifications. Apart from the music protocols, the synthesizer uses quantized variability for the fundamental frequency of the uttered phonemes in order to accurately depict intonation characteristics. The desired performance of the synthesizer is evaluated on the grounds of ´naturalness´ and prosodic ´exactness´. The former has to do with the depth of analysis for the synthesized phonemes and the latter with the ability to describe intonation phenomena traced in the context of the supported languages. Thus far, the synthesizer can perform sufficiently in English (US) and Greek. In order to describe phenomena like petasi or vareia (accent of grave tone) that have been detected in Greek vocal forms, the synthesizer is equipped with a user interface that describes these phenomena, compiles and quantizes them in pitch-duration vectors and feeds in the synthesizer with the adequate information in order to produce as natural as possible utterances of synthetic speech.
  • Keywords
    computer interfaces; electronic music; music; object-oriented methods; speech enhancement; speech synthesis; English; Greek; MIDI specifications; SKDNL specifications; fundamental frequency; grave tone accent; intonation characteristics; music protocols; musical object oriented formant synthesizer; performance evaluation; petasi; phonemes; pitch-duration vectors; prosodic enhancements; real time object oriented formant synthesizer; synthetic speech; user interface; vareia; Feeds; Frequency synthesizers; Instruments; Natural languages; Protocols; Psychoacoustic models; Signal synthesis; Speech synthesis; Timing; User interfaces;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrotechnical Conference, 2000. MELECON 2000. 10th Mediterranean
  • Print_ISBN
    0-7803-6290-X
  • Type

    conf

  • DOI
    10.1109/MELCON.2000.880030
  • Filename
    880030