DocumentCode :
591768
Title :
Effective sentence selection based on phone/model coverage maximization for speaker adaptation in HMM-based speech synthesis
Author :
Cheng Hsien Lin ; Po Kai Huang ; Cheng Yuan Lin ; Chih Chung Kuo
Author_Institution :
ITRI, Hsinchu, Taiwan
fYear :
2012
fDate :
5-8 Dec. 2012
Firstpage :
74
Lastpage :
78
Abstract :
Reducing the recording effort required in practical speaker adaptive text-to-speech applications would be very useful. In this paper, we present two sentence selection approaches based on a greedy algorithm; one is based on phone coverage and the other is based on model coverage. The former considers the phonetic information in speaker adaptation data, while the latter focuses on occurrences of Mel-cepstral and logF0 models in decision trees of the average voice model. To verify the efficacy of the proposed methods, we compare their performance with that of a random selection method in objective and subjective evaluations. The objective and subjective evaluation results demonstrate that both methods outperform the random selection method.
Keywords :
hidden Markov models; speech synthesis; HMM-based speech synthesis; Mel-cepstral models; logF0 models; model coverage; phone coverage; phone-model coverage maximization; random selection method; sentence selection; speaker adaptation data; speaker adaptive text-to-speech applications; Adaptation models; Data models; Greedy algorithms; Hidden Markov models; Speech; Speech synthesis; Training; HMM-based speech synthesis; greedy algorithm; model coverage; speaker adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
Type :
conf
DOI :
10.1109/ISCSLP.2012.6423469
Filename :
6423469
Link To Document :
بازگشت