DocumentCode :
25280
Title :
Harmonics Plus Noise Model Based Vocoder for Statistical Parametric Speech Synthesis
Author :
Erro, Daniel ; Sainz, I. ; Navas, Eva ; Hernaez, Inma
Author_Institution :
AhoLab Signal Process. Lab., Univ. of the Basque Country (UPV/EHU), Bilbao, Spain
Volume :
8
Issue :
2
fYear :
2014
fDate :
Apr-14
Firstpage :
184
Lastpage :
194
Abstract :
This article explores the potential of the harmonics plus noise model of speech in the development of a high-quality vocoder applicable in statistical frameworks, particularly in modern speech synthesizers. It presents an extensive explanation of all the different alternatives considered during the design of the HNM-based vocoder, together with the corresponding objective and subjective experiments, and a careful description of its implementation details. Three aspects of the analysis have been investigated: refinement of the pitch estimation using quasi-harmonic analysis, study and comparison of several spectral envelope analysis procedures, and strategies to analyze and model the maximum voiced frequency. The performance of the resulting vocoder is shown to be similar to that of state-of-the-art vocoders in synthesis tasks.
Keywords :
harmonic analysis; speech synthesis; statistical analysis; vocoders; HNM-based vocoder; harmonics plus noise model; maximum voiced frequency; objective experiments; pitch estimation; quasi-harmonic analysis; spectral envelope analysis procedures; statistical parametric speech synthesis; subjective experiments; Cepstral analysis; Estimation; Harmonic analysis; Interpolation; Noise; Speech; Vocoders; Harmonics plus noise model; statistical parametric speech synthesis; vocoder; voice transformation;
fLanguage :
English
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
Publisher :
ieee
ISSN :
1932-4553
Type :
jour
DOI :
10.1109/JSTSP.2013.2283471
Filename :
6609074
Link To Document :
بازگشت