• DocumentCode
    2755983
  • Title

    A novel quasi-diphone inventory approach to Text-To-Speech synthesis

  • Author

    Gerazov, Branislav ; Shutinoski, Goce ; Arsov, Goce

  • Author_Institution
    Dept. of Electron., Univ. of Ss. Cyril & Methodius, Skopje
  • fYear
    2008
  • fDate
    5-7 May 2008
  • Firstpage
    799
  • Lastpage
    804
  • Abstract
    The paper presents a novel approach to concatenative text-to-speech synthesis. The system uses a unique optimized mixed-rank inventory, based on a modification of the classical diphone concept. A new unit type is introduced in our work, dubbed the quasi-diphone unit. A set of these units is designed to cover all the critical transitions between phones and at the same time to be compatible with phone-length units for concatenation purposes. This allows for inventory optimization in respect to its size and quality of the generated speech. The system includes elementary pitch, duration and amplitude modeling implemented with the standard PSOLA algorithm. Presented results show that it was possible to achieve full intelligibility and reasonable naturalness whilst maintaining a rather small inventory. The system was specially developed for the synthesis of Macedonian, and is the first HQ TTS system for this language. Using the developed standardized interface between the modules, the system is also applicable to any of the worldpsilas languages.
  • Keywords
    natural language processing; speech synthesis; Macedonian; amplitude modeling; elementary pitch; quasi-diphone inventory approach; text-to-speech synthesis; time domain pitch-synchronous overlap add algorithm; Analog computers; Helium; History; Human voice; Information technology; Natural languages; Spectrogram; Speech processing; Speech synthesis; Synthesizers; Macedonian; TTS; concatenative synthesis; mixed-rank inventory; quasi-diphone;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrotechnical Conference, 2008. MELECON 2008. The 14th IEEE Mediterranean
  • Conference_Location
    Ajaccio
  • Print_ISBN
    978-1-4244-1632-5
  • Electronic_ISBN
    978-1-4244-1633-2
  • Type

    conf

  • DOI
    10.1109/MELCON.2008.4618533
  • Filename
    4618533