DocumentCode :
661516
Title :
Realizing Tibetan speech synthesis by speaker adaptive training
Author :
Hong-wu Yang ; Oura, Keiichiro ; Zhen-ye Gan ; Tokuda, Keiichi
Author_Institution :
Coll. of Phys. & Electron. Eng., Northwest Normal Univ., Lanzhou, China
fYear :
2013
fDate :
Oct. 29 2013-Nov. 1 2013
Firstpage :
1
Lastpage :
4
Abstract :
This paper presents a method to realize HMM-based Tibetan speech synthesis using a Mandarin speech synthesis framework. A Mandarin context-dependent label format is adopted to label Tibetan sentences. A Mandarin question set is also extended for Tibetan by adding language-specific questions. A Mandarin speech synthesis framework is utilized to train an average mixed-lingual model from a large Mandarin multi-speaker-based corpus and a small Tibetan one-speaker-based corpus using the speaker adaptive training. Then the speaker adaptation transformation is applied to the average mixed-lingual model to obtain a speaker adapted Tibetan model. Experimental results show that this method outperforms the method using speaker dependent Tibetan model when only a small amount of training Tibetan utterances are available. When the number of training Tibetan utterances is increased, the performances of the two methods tend to be the same.
Keywords :
hidden Markov models; natural language processing; speech synthesis; HMM-based Tibetan speech synthesis; Mandarin context-dependent label format; Mandarin multispeaker-based corpus; Mandarin question set; Mandarin speech synthesis framework; Tibetan one-speaker-based corpus; Tibetan sentence labeling; Tibetan utterances; average mixed-lingual model; hidden Markov model; language-specific questions; multilingual speech synthesis; speaker adaptive training; Adaptation models; Databases; Hidden Markov models; Silicon; Speech; Speech synthesis; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
Type :
conf
DOI :
10.1109/APSIPA.2013.6694379
Filename :
6694379
Link To Document :
بازگشت