DocumentCode :
2347951
Title :
The research and implementation of acoustic module based Mandarin TTS
Author :
Yeh, Cheng-Yu ; Chen, Kuan-Lin
Author_Institution :
Dept. of Electr. Eng., Nat. Chin-Yi Univ. of Technol., Taichung, Taiwan
fYear :
2010
fDate :
3-5 March 2010
Firstpage :
1
Lastpage :
4
Abstract :
The primary study of this paper is focused on the acoustic module (AM) design in order to improve the performance of Mandarin TTS system. The AM is composed of the prosody generator, the spectrum generator, and the speech synthesizer. The HMM, recurrent neural network (RNN), and PSOLA algorithms are employed to build the AM. Finally, the performance analyses including the speech quality, memory requirement, and computational complexity are examined in our system. Smaller than 2.4 MB memory space and average 0.08 MIPS for each syllable can be achieved on the fixed-point DSP chip. Also the synthesized speech sounds very good.
Keywords :
acoustic signal processing; hidden Markov models; recurrent neural nets; HMM; Mandarin TTS system; PSOLA algorithms; acoustic module design; computational complexity; fixed-point DSP chip; pitch-synchronous overlap-add approach; prosody generator; recurrent neural network; spectrum generator; speech quality; speech synthesizer; Data mining; Natural languages; Neural networks; Process control; Recurrent neural networks; Signal processing algorithms; Speech analysis; Speech synthesis; Synthesizers; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Control and Signal Processing (ISCCSP), 2010 4th International Symposium on
Conference_Location :
Limassol
Print_ISBN :
978-1-4244-6285-8
Type :
conf
DOI :
10.1109/ISCCSP.2010.5463382
Filename :
5463382
Link To Document :
بازگشت