Speech synthesis in the time domain from text

Author

Grossmann, E.

Author_Institution

Heinrich-Hertz-Institut Für Nachrichtentechnik, Berlin, W.Germany

Volume

7

fYear

1982

fDate

30072

Firstpage

936

Lastpage

939

Abstract

A very efficient method to synthesize speech is the combination of digitally stored monophones and transients in the time domain, because of its good intellegibility and the easy implementation which allows a very inexpensive realisation. A disadvantage inherent in procedures in the time domain has so far been the fact, that they required a very large sized memory, the size of which increased in multiples if prosodic parameters were also taken into account. We developed a system which synthesizes an unlimited vocabulary with a memory of only 22 kBytes (8-Bit Bytes). Further investigations showed, that by means of the linear prediction method it is possible to control the fundamental frequency of the speech signal in a wide range without storing additional speech segments. In addition to this work, we developed a system to transform orthografic text into a phoneme string automatically. We optimized this algorithm for the 8000 most frequent words of the German language. The whole system which is implemented on a microprozessor, is placed on a single board, with a storage of total 32 kBytes (8-Bit Bytes).

Keywords

Control system synthesis; Explosives; Frequency; Natural languages; Prediction methods; Signal analysis; Signal synthesis; Speech synthesis; Vocabulary; Vocoders;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.

Type

conf

DOI

10.1109/ICASSP.1982.1171863

Filename

1171863