DocumentCode
3059438
Title
Speech synthesis in the time domain from text
Author
Grossmann, E.
Author_Institution
Heinrich-Hertz-Institut Für Nachrichtentechnik, Berlin, W.Germany
Volume
7
fYear
1982
fDate
30072
Firstpage
936
Lastpage
939
Abstract
A very efficient method to synthesize speech is the combination of digitally stored monophones and transients in the time domain, because of its good intellegibility and the easy implementation which allows a very inexpensive realisation. A disadvantage inherent in procedures in the time domain has so far been the fact, that they required a very large sized memory, the size of which increased in multiples if prosodic parameters were also taken into account. We developed a system which synthesizes an unlimited vocabulary with a memory of only 22 kBytes (8-Bit Bytes). Further investigations showed, that by means of the linear prediction method it is possible to control the fundamental frequency of the speech signal in a wide range without storing additional speech segments. In addition to this work, we developed a system to transform orthografic text into a phoneme string automatically. We optimized this algorithm for the 8000 most frequent words of the German language. The whole system which is implemented on a microprozessor, is placed on a single board, with a storage of total 32 kBytes (8-Bit Bytes).
Keywords
Control system synthesis; Explosives; Frequency; Natural languages; Prediction methods; Signal analysis; Signal synthesis; Speech synthesis; Vocabulary; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
Type
conf
DOI
10.1109/ICASSP.1982.1171863
Filename
1171863
Link To Document