Title :
VLSI Architecture Design for Concatenative Speech Synthesizer
Author :
Chu, Li-Ping ; Wang, Jia-Ching ; Wang, Jhing-Fa
Author_Institution :
Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan
Abstract :
This paper presents a VLSI architecture for Mandarin speech synthesis. For the natural synthesized speech, subsyllable based synthesis units are recorded in advance. The synthesized speech is obtained by suitably concatenating the synthesis units. The TD-PSOLA (time domain pitch synchronous overlap-and-add) approach is used to perform the prosody modification. The proposed VLSI architecture includes two parts: the TD-PSOLA module and the synthesized pitch period generator. In the TD-PSOLA module, we also present a fast CORDIC architecture which is five times faster than the conventional method.
Keywords :
VLSI; digital arithmetic; natural language processing; speech synthesis; time-domain analysis; CORDIC architecture; Mandarin speech synthesis; TD-PSOLA module; VLSI architecture; concatenative speech synthesizer; natural synthesized speech; prosody modification; subsyllable based synthesis units; time domain pitch synchronous overlap-and-add approach; Design optimization; Frequency domain analysis; Hardware; Natural languages; Speech synthesis; Synthesizers; System-on-a-chip; Very large scale integration;
Conference_Titel :
TENCON 2005 2005 IEEE Region 10
Conference_Location :
Melbourne, Qld.
Print_ISBN :
0-7803-9311-2
Electronic_ISBN :
0-7803-9312-0
DOI :
10.1109/TENCON.2005.301087