Title :
A multilingual TTS system with less than 1 Mbyte footprint for embedded applications
Author :
Hoffmann, R. ; Jokisch, O. ; Hirschfeld, D. ; Strecha, G. ; Kruschke, H. ; Kordon, U. ; Koloska, U.
Author_Institution :
Dresden Univ. of Technol., Germany
Abstract :
Text-to-speech (TTS) systems have improved their quality to a large extent lately. This development has resulted in memory requirements of several megabytes that cannot be accepted in many applications, especially in embedded systems. Such applications are usually limited to a footprint of as much as 1 megabyte and require the processing power to be as low as possible. These requirements may be met if the text processing is changed from the usual data-driven algorithms to rule-based processing. Furthermore, the inventory (diphone inventory) should be as small as possible and should be stored in a compressed manner. This is demonstrated by a modified version of the Dresden speech synthesis system, DRESS, which is called microDRESS. Compared to the baseline system, microDRESS does not show essential quality losses apart from the influences of the telephone bandwidth which is appropriate for many embedded applications.
Keywords :
embedded systems; speech processing; speech synthesis; text analysis; Dresden speech synthesis system; diphone inventory; embedded applications; multilingual TTS system; rule-based processing; telephone bandwidth; text processing; text-to-speech systems; Bandwidth; Data flow computing; Databases; Design optimization; Embedded system; Natural languages; Speech coding; Speech synthesis; Telephony; Text processing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198835