Title :
Harmonics Plus Noise Model Based Vocoder for Statistical Parametric Speech Synthesis
Author :
Erro, Daniel ; Sainz, I. ; Navas, Eva ; Hernaez, Inma
Author_Institution :
AhoLab Signal Process. Lab., Univ. of the Basque Country (UPV/EHU), Bilbao, Spain
Abstract :
This article explores the potential of the harmonics plus noise model of speech in the development of a high-quality vocoder applicable in statistical frameworks, particularly in modern speech synthesizers. It presents an extensive explanation of all the different alternatives considered during the design of the HNM-based vocoder, together with the corresponding objective and subjective experiments, and a careful description of its implementation details. Three aspects of the analysis have been investigated: refinement of the pitch estimation using quasi-harmonic analysis, study and comparison of several spectral envelope analysis procedures, and strategies to analyze and model the maximum voiced frequency. The performance of the resulting vocoder is shown to be similar to that of state-of-the-art vocoders in synthesis tasks.
Keywords :
harmonic analysis; speech synthesis; statistical analysis; vocoders; HNM-based vocoder; harmonics plus noise model; maximum voiced frequency; objective experiments; pitch estimation; quasi-harmonic analysis; spectral envelope analysis procedures; statistical parametric speech synthesis; subjective experiments; Cepstral analysis; Estimation; Harmonic analysis; Interpolation; Noise; Speech; Vocoders; Harmonics plus noise model; statistical parametric speech synthesis; vocoder; voice transformation;
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
DOI :
10.1109/JSTSP.2013.2283471