DocumentCode :
3444126
Title :
A multilingual TTS system with less than 1 Mbyte footprint for embedded applications
Author :
Hoffmann, R. ; Jokisch, O. ; Hirschfeld, D. ; Strecha, G. ; Kruschke, H. ; Kordon, U. ; Koloska, U.
Author_Institution :
Dresden Univ. of Technol., Germany
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
Text-to-speech (TTS) systems have improved their quality to a large extent lately. This development has resulted in memory requirements of several megabytes that cannot be accepted in many applications, especially in embedded systems. Such applications are usually limited to a footprint of as much as 1 megabyte and require the processing power to be as low as possible. These requirements may be met if the text processing is changed from the usual data-driven algorithms to rule-based processing. Furthermore, the inventory (diphone inventory) should be as small as possible and should be stored in a compressed manner. This is demonstrated by a modified version of the Dresden speech synthesis system, DRESS, which is called microDRESS. Compared to the baseline system, microDRESS does not show essential quality losses apart from the influences of the telephone bandwidth which is appropriate for many embedded applications.
Keywords :
embedded systems; speech processing; speech synthesis; text analysis; Dresden speech synthesis system; diphone inventory; embedded applications; multilingual TTS system; rule-based processing; telephone bandwidth; text processing; text-to-speech systems; Bandwidth; Data flow computing; Databases; Design optimization; Embedded system; Natural languages; Speech coding; Speech synthesis; Telephony; Text processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198835
Filename :
1198835
Link To Document :
بازگشت