DocumentCode :
2963003
Title :
Speech Translation Statistical System Using Multimodal Sources of Knowledge
Author :
Tomás, Jesus ; Canovas, Alejandro ; Lloret, Jaime ; García, Miguel
Author_Institution :
Inst. de Investig. para la Gestion Integrada de Zonas Costeras, Univ. Politec. de Valencia, Gandia, Spain
fYear :
2010
fDate :
20-25 Sept. 2010
Firstpage :
5
Lastpage :
9
Abstract :
The synergic combination of different sources of knowledge is a key aspect in the development of modern statistical translators. The effect and implications of adding additional other-than-voice information in a voice translation system is described in this work. The additional information serves as the bases for the log-linear combination of several statistical models. A prototype that implements a real-time speech translation system from Spanish to English that is adapted to specific teaching-related environments is presented. In the scenario of analysis a teacher as speaker giving an educational class could use a real time translation system with foreign students. The teacher could add slides or class notes as additional reference to the voice translation system. Should notes be already translated into the destination language the system could have even more accuracy. We present the theoretical framework of the problem, summarize the overall architecture of the system, show how the system is enhanced with capabilities related to capturing the additional information; and finally present the initial performance results.
Keywords :
language translation; speech processing; statistical analysis; English; Spanish; foreign students; log-linear combination; modern statistical translators; multimodal sources; other-than-voice information; real time translation system; real-time speech translation system; speech translation statistical system; statistical models; voice translation system; Acoustics; Adaptation model; Hidden Markov models; Prototypes; Real time systems; Speech; Speech recognition; adaptation; component; pedagogical tool; speech recognition; speech translation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing in the Global Information Technology (ICCGI), 2010 Fifth International Multi-Conference on
Conference_Location :
Valencia
Print_ISBN :
978-1-4244-8068-5
Electronic_ISBN :
978-0-7695-4181-5
Type :
conf
DOI :
10.1109/ICCGI.2010.26
Filename :
5628801
Link To Document :
بازگشت