• DocumentCode
    3059438
  • Title

    Speech synthesis in the time domain from text

  • Author

    Grossmann, E.

  • Author_Institution
    Heinrich-Hertz-Institut Für Nachrichtentechnik, Berlin, W.Germany
  • Volume
    7
  • fYear
    1982
  • fDate
    30072
  • Firstpage
    936
  • Lastpage
    939
  • Abstract
    A very efficient method to synthesize speech is the combination of digitally stored monophones and transients in the time domain, because of its good intellegibility and the easy implementation which allows a very inexpensive realisation. A disadvantage inherent in procedures in the time domain has so far been the fact, that they required a very large sized memory, the size of which increased in multiples if prosodic parameters were also taken into account. We developed a system which synthesizes an unlimited vocabulary with a memory of only 22 kBytes (8-Bit Bytes). Further investigations showed, that by means of the linear prediction method it is possible to control the fundamental frequency of the speech signal in a wide range without storing additional speech segments. In addition to this work, we developed a system to transform orthografic text into a phoneme string automatically. We optimized this algorithm for the 8000 most frequent words of the German language. The whole system which is implemented on a microprozessor, is placed on a single board, with a storage of total 32 kBytes (8-Bit Bytes).
  • Keywords
    Control system synthesis; Explosives; Frequency; Natural languages; Prediction methods; Signal analysis; Signal synthesis; Speech synthesis; Vocabulary; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1982.1171863
  • Filename
    1171863