Title :
Robust generation of symbolic prosody by a neural classifier based on autoassociators
Author :
Muller, A.F. ; Zimmermann, Hans Georg ; Neuneier, Ralph
Author_Institution :
Siemens Corp., Munich, Germany
Abstract :
In this paper a highly robust method to predict symbolic prosody labels for speech synthesis is proposed. This method is based on a two stage approach. In the first stage the characteristics of each symbolic prosody label are captured by autoassociative models, which are trained independently. In the second stage detailed error information obtained from the different autoassociative models is used to train a neural classifier yielding class conditional probabilities. The method has been successfully applied for German and English language. For the latter the exact same data bases as used by Black and Taylor (1997) and Ostendorf and Veilleux (1994) were used to test the method. The results obtained are superior to those reported for the HMM-based and CART-based approaches. Further experiments also demonstrate considerable generalizing ability, yielding high robustness in sparse training material conditions
Keywords :
learning (artificial intelligence); neural nets; probability; speech synthesis; English language; German language; autoassociators; class conditional probabilities; error information; neural classifier; robust generation; speech synthesis; symbolic prosody; training; two stage approach; Classification algorithms; Costs; Hidden Markov models; History; Natural languages; Neural networks; Regression tree analysis; Robustness; Speech synthesis; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.861812