Title :
Integrating text and phonetic information for robust statistical speech translation
Author :
Liang Gu ; Yonggang Deng ; Wei Zhang ; Yuqing Gao
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY
Abstract :
This paper focuses on the use of both text and phonetic information in a speech translation system in order to make translation results more robust to speech recognition errors. Conventional statistical speech translation formulas are extended to exploit both text-form and phonetic speech recognition results. A novel data-driven word/text tying algorithm is then proposed to group words based on both pronunciation similarity and meaning equivalency. In our speech-to-text translation experiments, significant improvement was achieved by using phonetic information and the proposed word tying algorithm.
Keywords :
language translation; speech recognition; text analysis; phonetic information; phonetic speech recognition; robust statistical speech translation; speech recognition errors; speech-to-text translation; text information; Artificial intelligence; Automatic speech recognition; Data mining; Design optimization; Erbium; Natural languages; Robustness; Speech recognition; Surface-mount technology; Training data;
Conference_Titel :
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location :
Palm Beach
Print_ISBN :
1-4244-0872-5
DOI :
10.1109/SLT.2006.326804