DocumentCode
3444126
Title
A multilingual TTS system with less than 1 Mbyte footprint for embedded applications
Author
Hoffmann, R. ; Jokisch, O. ; Hirschfeld, D. ; Strecha, G. ; Kruschke, H. ; Kordon, U. ; Koloska, U.
Author_Institution
Dresden Univ. of Technol., Germany
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
Text-to-speech (TTS) systems have improved their quality to a large extent lately. This development has resulted in memory requirements of several megabytes that cannot be accepted in many applications, especially in embedded systems. Such applications are usually limited to a footprint of as much as 1 megabyte and require the processing power to be as low as possible. These requirements may be met if the text processing is changed from the usual data-driven algorithms to rule-based processing. Furthermore, the inventory (diphone inventory) should be as small as possible and should be stored in a compressed manner. This is demonstrated by a modified version of the Dresden speech synthesis system, DRESS, which is called microDRESS. Compared to the baseline system, microDRESS does not show essential quality losses apart from the influences of the telephone bandwidth which is appropriate for many embedded applications.
Keywords
embedded systems; speech processing; speech synthesis; text analysis; Dresden speech synthesis system; diphone inventory; embedded applications; multilingual TTS system; rule-based processing; telephone bandwidth; text processing; text-to-speech systems; Bandwidth; Data flow computing; Databases; Design optimization; Embedded system; Natural languages; Speech coding; Speech synthesis; Telephony; Text processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198835
Filename
1198835
Link To Document