DocumentCode :
3146633
Title :
Generation method of Lanzhou dialect speech based on Gaussian Mixture Model
Author :
Zhen-ye, Gan ; Hong-zhi, Yu ; Hong-wu, Yang
Author_Institution :
Key Lab. of China Nat. Linguistic Inf. Technol., Northwest Univ. for Nat., Lanzhou, China
fYear :
2011
fDate :
16-18 April 2011
Firstpage :
4108
Lastpage :
4111
Abstract :
Dialect generation is one of the most important aspects of Chinese speech synthesis. Using the method of conversion prosodic features we can realize high quality speech synthesis. Firstly, A Lanzhou dialect corpus has been built based on "word-list in dialectal survey" for the generation of Lanzhou dialect. Speech corpus was recorded with contrastive (Lanzhou dialect vs. Mandarin) recordings. A pitch target model is introduced, which is optimized to describe feature parameters of the Mandarin speech and Lanzhou dialect speech in the training set of speech corpus. Secondly, the Gaussian Mixture Model (GMM) can map the subtle prosody distributions between Mandarin and Lanzhou dlialect speech, we train GMM conversion parameter in the training set, and get converted F0 contours of Lanzhou dialect speech by GMM conversion parameter. Using the converted Lanzhou dlialect F0 contours, we can generate high quality Lanzhou dlialect speech by STRAIGHT algorithm. Subjective experiments demonstrated that the generated speech achieve 4.06 of the average mean opinion score(MOS).
Keywords :
Gaussian processes; natural language processing; speech synthesis; Chinese speech synthesis; Gaussian mixture model; Lanzhou dialect speech; Lanzhou dlialect F0 contours; Mandarin dlialect speech; STRAIGHT algorithm; dialect generation method; mean opinion score; pitch target model; speech corpus; Acoustics; Mathematical model; Speech; Speech synthesis; Testing; Training; F0 contours; Gaussian Mixture Model; Lanzhou Dialect Speech; Speech Generation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Consumer Electronics, Communications and Networks (CECNet), 2011 International Conference on
Conference_Location :
XianNing
Print_ISBN :
978-1-61284-458-9
Type :
conf
DOI :
10.1109/CECNET.2011.5768162
Filename :
5768162
Link To Document :
بازگشت