DocumentCode :
730697
Title :
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis
Author :
Yuchen Fan ; Yao Qian ; Soong, Frank K. ; Lei He
Author_Institution :
Microsoft Res. Asia, Beijing, China
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4475
Lastpage :
4479
Abstract :
In DNN-based TTS synthesis, DNNs hidden layers can be viewed as deep transformation for linguistic features and the output layers as representation of acoustic space to regress the transformed linguistic features to acoustic parameters. The deep-layered architectures of DNN can not only represent highly-complex transformation compactly, but also take advantage of huge amount of training data. In this paper, we propose an approach to model multiple speakers TTS with a general DNN, where the same hidden layers are shared among different speakers while the output layers are composed of speaker-dependent nodes explaining the target of each speaker. The experimental results show that our approach can significantly improve the quality of synthesized speech objectively and subjectively, comparing with speech synthesized from the individual, speaker-dependent DNN-based TTS. We further transfer the hidden layers for a new speaker with limited training data and the resultant synthesized speech of the new speaker can also achieve a good quality in term of naturalness and speaker similarity.
Keywords :
neural nets; speech synthesis; DNN-based TTS synthesis; deep neural networks; deep-layered architectures; linguistic features; multispeaker modeling; speaker adaptation; speaker-dependent nodes; text-to-speech synthesis; Acoustics; Adaptation models; Hidden Markov models; Pragmatics; Speech; Training; Training data; deep neural networks; multi-task learning; statistical parametric speech synthesis; transfer learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178817
Filename :
7178817
Link To Document :
بازگشت