DocumentCode
3714243
Title
A straightforward method for calculating the voicing cut-off frequency for streaming HNM TTS
Author
J.A. Louw
Author_Institution
Human Language Technologies Research Group, Meraka Institute, CSIR, Pretoria, South Africa
fYear
2015
Firstpage
252
Lastpage
257
Abstract
The Harmonic plus Noise Model vocoder produces natural text-to-speech synthesis without some of the artifacts encountered in other vocoders. However, in order to achieve this naturalness one needs to determine a voicing cut-off frequency for each frame of speech being synthesized. This has proven to be a challenge and there are many methods and implementations, all with certain trade-offs. We present here a straightforward method, based on cepstral energy, that can also be used in streaming HNM TTS synthesis.
Keywords
"Voltage-controlled oscillators","Speech","Harmonic analysis","Cutoff frequency","Cepstral analysis","Hidden Markov models","Vocoders"
Publisher
ieee
Conference_Titel
Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), 2015
Type
conf
DOI
10.1109/RoboMech.2015.7359531
Filename
7359531
Link To Document