Title :
The research and implementation of acoustic module based Mandarin TTS
Author :
Yeh, Cheng-Yu ; Chen, Kuan-Lin
Author_Institution :
Dept. of Electr. Eng., Nat. Chin-Yi Univ. of Technol., Taichung, Taiwan
Abstract :
The primary study of this paper is focused on the acoustic module (AM) design in order to improve the performance of Mandarin TTS system. The AM is composed of the prosody generator, the spectrum generator, and the speech synthesizer. The HMM, recurrent neural network (RNN), and PSOLA algorithms are employed to build the AM. Finally, the performance analyses including the speech quality, memory requirement, and computational complexity are examined in our system. Smaller than 2.4 MB memory space and average 0.08 MIPS for each syllable can be achieved on the fixed-point DSP chip. Also the synthesized speech sounds very good.
Keywords :
acoustic signal processing; hidden Markov models; recurrent neural nets; HMM; Mandarin TTS system; PSOLA algorithms; acoustic module design; computational complexity; fixed-point DSP chip; pitch-synchronous overlap-add approach; prosody generator; recurrent neural network; spectrum generator; speech quality; speech synthesizer; Data mining; Natural languages; Neural networks; Process control; Recurrent neural networks; Signal processing algorithms; Speech analysis; Speech synthesis; Synthesizers; Text analysis;
Conference_Titel :
Communications, Control and Signal Processing (ISCCSP), 2010 4th International Symposium on
Conference_Location :
Limassol
Print_ISBN :
978-1-4244-6285-8
DOI :
10.1109/ISCCSP.2010.5463382