Title :
Speaker Adaptation Using Nonlinear Regression Techniques for HMM-Based Speech Synthesis
Author :
Doo Hwa Hong ; Shin Jae Kang ; Joun Yeop Lee ; Nam Soo Kim
Author_Institution :
Dept. of Electr. & Comput. Eng. & INMC, Seoul Nat. Univ., Seoul, South Korea
Abstract :
The maximum likelihood linear regression (MLLR) technique is a well-known approach to parameter adaptation in hidden Markov model (HMM)-based systems. In this paper, we propose the maximum penalized likelihood kernel regression (MPLKR) approach as a novel adaptation technique for HMM-based speech synthesis. The proposed algorithm performs a nonlinear regression between the mean vector of the base model and the corresponding mean vector of adaptive data by means of a kernel method. In the experiments, we used various types of parametric kernels for the proposed algorithm and compared their performances with the conventional method. From experimental results, it has been found that the proposed algorithm outperforms the conventional method in terms of the objective measure as well as the subjective listening quality.
Keywords :
hidden Markov models; maximum likelihood estimation; regression analysis; speech synthesis; HMM; MPLKR approach; hidden Markov model; maximum penalized likelihood kernel regression; mean vector; nonlinear regression techniques; parametric kernels; speaker adaptation; speech synthesis; subjective listening quality; Adaptation models; Feature extraction; Hidden Markov models; Kernel; Speech; Speech synthesis; Vectors; HMM-based speech synthesis; kernel; maximum likelihood linear regression (MLLR); maximum penalized likelihood kernel regression (MPLKR);
Conference_Titel :
Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), 2014 Tenth International Conference on
Conference_Location :
Kitakyushu
Print_ISBN :
978-1-4799-5389-9
DOI :
10.1109/IIH-MSP.2014.152