DocumentCode :
2975724
Title :
Toward machine translation with statistics and syntax and semantics
Author :
Wu, Dekai
Author_Institution :
Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
fYear :
2009
fDate :
Nov. 13 2009-Dec. 17 2009
Firstpage :
12
Lastpage :
21
Abstract :
In this paper, we survey some central issues in the historical, current, and future landscape of statistical machine translation (SMT) research, taking as a starting point an extended three-dimensional MT model space. We posit a socio-geographical conceptual disparity hypothesis, that aims to explain why language pairs like Chinese-English have presented MT with so much more difficulty than others. The evolution from simple token-based to segment-based to tree-based syntactic SMT is sketched. For tree-based SMT, we consider language bias rationales for selecting the degree of compositional power within the hierarchy of expressiveness for transduction grammars (or synchronous grammars). This leads us to inversion transductions and the ITG model prevalent in current state-of-the-art SMT, along with the underlying ITG hypothesis, which posits a language universal. Against this backdrop, we enumerate a set of key open questions for syntactic SMT. We then consider the more recent area of semantic SMT. We list principles for successful application of sense disambiguation models to semantic SMT, and describe early directions in the use of semantic role labeling for semantic SMT.
Keywords :
computational linguistics; language translation; programming language semantics; statistics; SMT semantic; hierarchy transduction grammars; inversion transductions; semantics; sense disambiguation models; set key open; socio geographical conceptual; statistical machine translation; statistics; syntax; three dimensional MT model space; token based evolution; toward machine translation; tree based syntactic SMT; Computer science; Hardware; Humans; Labeling; Machine learning; Pattern recognition; Space technology; Speech recognition; Statistics; Surface-mount technology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
Type :
conf
DOI :
10.1109/ASRU.2009.5373509
Filename :
5373509
Link To Document :
بازگشت