DocumentCode
63991
Title
Maximum Voiced Frequency Estimation: Exploiting Amplitude and Phase Spectra
Author
Drugman, Thomas ; Stylianou, Yannis
Author_Institution
Toshiba Cambridge Res. Lab., Cambridge, UK
Volume
21
Issue
10
fYear
2014
fDate
Oct. 2014
Firstpage
1230
Lastpage
1234
Abstract
Maximum Voiced Frequency (MVF) is used in various speech models as the spectral boundary separating periodic and aperiodic components during the production of voiced sounds. Recent studies have shown that its proper estimation and modeling enhance the quality of statistical parametric speech synthesizers. Contrastingly, these same methods of MVF estimation have been reported to degrade the performance of singing voice synthesizers. This paper proposes a new approach for MVF estimation which exploits both amplitude and phase spectra. It is shown that phase conveys relevant information about the harmonicity of the voice signal, and that it can be jointly used with features derived from the amplitude spectrum. This information is further integrated into a maximum likelihood criterion which provides a decision about the MVF estimate. The proposed technique is compared to two state-of-the-art methods, and shows a superior performance in both objective and subjective evaluations. Perceptual tests indicate a drastic improvement in high-pitched voices.
Keywords
maximum likelihood estimation; speech synthesis; MVF estimation; amplitude spectra; maximum likelihood criterion; maximum voiced frequency estimation; phase spectra; singing voice synthesizers; spectral boundary; statistical parametric speech synthesizers; Frequency estimation; Harmonic analysis; Hidden Markov models; Maximum likelihood estimation; Noise; Speech; High-pitched; maximum voiced frequency; phase processing; singing voice; speech analysis; speech synthesis;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2014.2332186
Filename
6840999
Link To Document