مرکز منطقه ای اطلاع رساني علوم و فناوري - A Japanese TTS system based on multiform units and a speech modification algorithm with harmonics reconstruction

DocumentCode :

1416837

Title :

A Japanese TTS system based on multiform units and a speech modification algorithm with harmonics reconstruction

Author :

Takano, Satoshi ; Tanaka, Kimihito ; Mizuno, Hideyuki ; Abe, Masanobu ; Nakajima, Shinya

Author_Institution :

Cyber Space Labs., NTT, Kanagawa, Japan

Volume :

Issue :

fYear :

2001

fDate :

1/1/2001 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

This paper proposes a new text-to-speech (TTS) system that utilizes large numbers of speech segments to produce very natural and intelligible synthetic speech. There are two innovations; new multiform synthesis units and a new speech modification algorithm based on a vocoder that offers harmonics reconstruction. The multiform units make it possible to reduce acoustic discontinuities at concatenation points and unnatural sound by preparing synthesis units with various lengths and various F₀ contours. The new speech modification algorithm, on the other hand, improves the quality of prosody modified speech. This algorithm is extremely effective in synthesizing speech whose prosodic parameters are quite different from those of synthesis units. Listening tests confirm that the new synthesis units yield speech with high intelligibility and naturalness, and that the new speech modification algorithm is superior to all other conventional vocoders and waveform domain algorithms including TD-PSOLA, especially when modifying the F₀ frequency upward

Keywords :

speech intelligibility; speech synthesis; Japanese TTS system; acoustic discontinuities; concatenation points; harmonics reconstruction; intelligibility; multiform synthesis units; multiform units; naturalness; prosody modified speech; speech modification algorithm; speech segments; text-to-speech synthesis system; unnatural sound; vocoder; Degradation; Frequency synthesizers; Natural languages; Spectral analysis; Speech synthesis; Technological innovation; Telegraphy; Telephony; Testing; Vocoders;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.890065

Filename :

890065

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1416837