مرکز منطقه ای اطلاع رساني علوم و فناوري - The research and implementation of acoustic module based Mandarin TTS

DocumentCode :

2347951

Title :

The research and implementation of acoustic module based Mandarin TTS

Author :

Yeh, Cheng-Yu ; Chen, Kuan-Lin

Author_Institution :

Dept. of Electr. Eng., Nat. Chin-Yi Univ. of Technol., Taichung, Taiwan

fYear :

2010

fDate :

3-5 March 2010

Firstpage :

Lastpage :

Abstract :

The primary study of this paper is focused on the acoustic module (AM) design in order to improve the performance of Mandarin TTS system. The AM is composed of the prosody generator, the spectrum generator, and the speech synthesizer. The HMM, recurrent neural network (RNN), and PSOLA algorithms are employed to build the AM. Finally, the performance analyses including the speech quality, memory requirement, and computational complexity are examined in our system. Smaller than 2.4 MB memory space and average 0.08 MIPS for each syllable can be achieved on the fixed-point DSP chip. Also the synthesized speech sounds very good.

Keywords :

acoustic signal processing; hidden Markov models; recurrent neural nets; HMM; Mandarin TTS system; PSOLA algorithms; acoustic module design; computational complexity; fixed-point DSP chip; pitch-synchronous overlap-add approach; prosody generator; recurrent neural network; spectrum generator; speech quality; speech synthesizer; Data mining; Natural languages; Neural networks; Process control; Recurrent neural networks; Signal processing algorithms; Speech analysis; Speech synthesis; Synthesizers; Text analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Communications, Control and Signal Processing (ISCCSP), 2010 4th International Symposium on

Conference_Location :

Limassol

Print_ISBN :

978-1-4244-6285-8

Type :

conf

DOI :

10.1109/ISCCSP.2010.5463382

Filename :

5463382

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2347951