A straightforward method for calculating the voicing cut-off frequency for streaming HNM TTS

Author

J.A. Louw

Author_Institution

Human Language Technologies Research Group, Meraka Institute, CSIR, Pretoria, South Africa

fYear

2015

Firstpage

252

Lastpage

257

Abstract

The Harmonic plus Noise Model vocoder produces natural text-to-speech synthesis without some of the artifacts encountered in other vocoders. However, in order to achieve this naturalness one needs to determine a voicing cut-off frequency for each frame of speech being synthesized. This has proven to be a challenge and there are many methods and implementations, all with certain trade-offs. We present here a straightforward method, based on cepstral energy, that can also be used in streaming HNM TTS synthesis.

Keywords

"Voltage-controlled oscillators","Speech","Harmonic analysis","Cutoff frequency","Cepstral analysis","Hidden Markov models","Vocoders"

Publisher

ieee

Conference_Titel

Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), 2015

Type

conf

DOI

10.1109/RoboMech.2015.7359531

Filename

7359531

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3714243