Title :
Speech synthesis using warped linear prediction and neural networks
Author :
Karjalainen, Matti ; Altosaar, Toomas ; Vainio, Martti
Author_Institution :
Lab. of Acoust. & Audio Signal Process., Helsinki Univ. of Technol., Espoo, Finland
Abstract :
A text-to-speech synthesis technique, based on warped linear prediction (WLP) and neural networks, is presented for high-quality individual sounding synthetic speech. Warped linear prediction is used as a speech production model with wide audio bandwidth yet with highly compressed control parameter data. An excitation codebook, inverse filtered from a target speaker´s voice, is applied to obtain individual tone quality. A set of neural networks, specialized to yield synthesis control parameters from phonemic input in specific contexts, generate the detailed parametric controls of WLP. Neural nets are also used successfully to compute the prosodic parameters. We have applied this approach in prototyping highly improved text-to-speech synthesis for the Finnish language
Keywords :
data compression; filtering theory; inverse problems; learning (artificial intelligence); linear predictive coding; neural nets; speech coding; speech intelligibility; speech synthesis; Finnish language; compressed control parameter data; excitation codebook; inverse filtered codebook; network training; neural networks; phonemic input; prosodic parameters; speech coding; synthesis control parameters; synthetic speech; text-to-speech synthesis; tone quality; warped linear prediction; Bandwidth; Control system synthesis; Humans; Network synthesis; Neural networks; Nonlinear filters; Predictive models; Sampling methods; Speech processing; Speech synthesis;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675405