مرکز منطقه ای اطلاع رساني علوم و فناوري - VLSI Architecture Design for Concatenative Speech Synthesizer

DocumentCode :

3383961

Title :

VLSI Architecture Design for Concatenative Speech Synthesizer

Author :

Chu, Li-Ping ; Wang, Jia-Ching ; Wang, Jhing-Fa

Author_Institution :

Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan

fYear :

2005

fDate :

21-24 Nov. 2005

Firstpage :

Lastpage :

Abstract :

This paper presents a VLSI architecture for Mandarin speech synthesis. For the natural synthesized speech, subsyllable based synthesis units are recorded in advance. The synthesized speech is obtained by suitably concatenating the synthesis units. The TD-PSOLA (time domain pitch synchronous overlap-and-add) approach is used to perform the prosody modification. The proposed VLSI architecture includes two parts: the TD-PSOLA module and the synthesized pitch period generator. In the TD-PSOLA module, we also present a fast CORDIC architecture which is five times faster than the conventional method.

Keywords :

VLSI; digital arithmetic; natural language processing; speech synthesis; time-domain analysis; CORDIC architecture; Mandarin speech synthesis; TD-PSOLA module; VLSI architecture; concatenative speech synthesizer; natural synthesized speech; prosody modification; subsyllable based synthesis units; time domain pitch synchronous overlap-and-add approach; Design optimization; Frequency domain analysis; Hardware; Natural languages; Speech synthesis; Synthesizers; System-on-a-chip; Very large scale integration;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

TENCON 2005 2005 IEEE Region 10

Conference_Location :

Melbourne, Qld.

Print_ISBN :

0-7803-9311-2

Electronic_ISBN :

0-7803-9312-0

Type :

conf

DOI :

10.1109/TENCON.2005.301087

Filename :

4085276

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3383961