Title :
Generation method of Lanzhou dialect speech based on Gaussian Mixture Model
Author :
Zhen-ye, Gan ; Hong-zhi, Yu ; Hong-wu, Yang
Author_Institution :
Key Lab. of China Nat. Linguistic Inf. Technol., Northwest Univ. for Nat., Lanzhou, China
Abstract :
Dialect generation is one of the most important aspects of Chinese speech synthesis. Using the method of conversion prosodic features we can realize high quality speech synthesis. Firstly, A Lanzhou dialect corpus has been built based on "word-list in dialectal survey" for the generation of Lanzhou dialect. Speech corpus was recorded with contrastive (Lanzhou dialect vs. Mandarin) recordings. A pitch target model is introduced, which is optimized to describe feature parameters of the Mandarin speech and Lanzhou dialect speech in the training set of speech corpus. Secondly, the Gaussian Mixture Model (GMM) can map the subtle prosody distributions between Mandarin and Lanzhou dlialect speech, we train GMM conversion parameter in the training set, and get converted F0 contours of Lanzhou dialect speech by GMM conversion parameter. Using the converted Lanzhou dlialect F0 contours, we can generate high quality Lanzhou dlialect speech by STRAIGHT algorithm. Subjective experiments demonstrated that the generated speech achieve 4.06 of the average mean opinion score(MOS).
Keywords :
Gaussian processes; natural language processing; speech synthesis; Chinese speech synthesis; Gaussian mixture model; Lanzhou dialect speech; Lanzhou dlialect F0 contours; Mandarin dlialect speech; STRAIGHT algorithm; dialect generation method; mean opinion score; pitch target model; speech corpus; Acoustics; Mathematical model; Speech; Speech synthesis; Testing; Training; F0 contours; Gaussian Mixture Model; Lanzhou Dialect Speech; Speech Generation;
Conference_Titel :
Consumer Electronics, Communications and Networks (CECNet), 2011 International Conference on
Conference_Location :
XianNing
Print_ISBN :
978-1-61284-458-9
DOI :
10.1109/CECNET.2011.5768162