Title :
Dictionary-learning-based post-filter for HMM-based speech synthesis
Author :
Praneeth Kurpad Narayanamurthy;Chandra Sekhar Seelamantula
Author_Institution :
Department of Electrical Engineering, Indian Institute of Science Bangalore - 560 012, India
Abstract :
Oversmoothing of speech parameter trajectories is one of the causes for quality degradation of HMM-based speech synthesis. Various methods have been proposed to overcome this effect, the most recent ones being global variance (GV) and modulation-spectrum-based post-filter (MSPF). However, there is still a significant quality gap between natural and synthesized speech. In this paper, we propose a two-fold post-filtering technique to alleviate to a certain extent the oversmoothing of spectral and excitation parameter trajectories of HMM-based speech synthesis. For the spectral parameters, we propose a sparse-coding-based post-filter to match the trajectories of synthetic speech to that of natural speech, and for the excitation trajectory, we introduce a perceptually motivated post-filter. Experimental evaluations show quality improvement compared with existing methods.
Keywords :
"Hidden Markov models","Speech","Trajectory","Natural languages","Dictionaries","Speech synthesis","Training data"
Conference_Titel :
TENCON 2015 - 2015 IEEE Region 10 Conference
Print_ISBN :
978-1-4799-8639-2
Electronic_ISBN :
2159-3450
DOI :
10.1109/TENCON.2015.7373091