DocumentCode :
1687507
Title :
HMM-based speech synthesis adaptation using noisy data: Analysis and evaluation methods
Author :
Karhila, Reima ; Remes, Ulpu ; Kurimo, Mikko
Author_Institution :
Sch. of Sci., Dept. of Inf. & Comput. Sci., Aalto Univ., Aalto, Finland
fYear :
2013
Firstpage :
6930
Lastpage :
6934
Abstract :
This paper investigates the role of noise in speaker-adaptation of HMM-based text-to-speech (TTS) synthesis and presents a new evaluation procedure. Both a new listening test based on ITU-T recommendation 835 and a perceptually motivated objective measure, frequency-weighted segmental SNR, improve the evaluation of synthetic speech when noise is present. The evaluation of voices adapted with noisy data show that the noise plays a relatively small but noticeable role in the quality of synthetic speech: Naturalness and speaker similarity are not affected in a significant way by the noise, but listeners prefer the voices trained from cleaner data. Noise removal, even when it degrades natural speech quality, improves the synthetic voice.
Keywords :
hidden Markov models; signal denoising; speaker recognition; speech synthesis; HMM-based speech synthesis adaptation; ITU-T recommendation 835; TTS synthesis; analysis methods; evaluation methods; frequency-weighted segmental SNR; listening test; natural speech quality; noise removal; noisy data; perceptually motivated objective measure; speaker-adaptation; synthetic voice; text-to-speech synthesis; Hidden Markov models; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Speech synthesis; Adaptation; Evaluation; Feature extraction; Noise robustness; Speech Synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639005
Filename :
6639005
Link To Document :
بازگشت