Title :
Toward machine translation with statistics and syntax and semantics
Author_Institution :
Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
In this paper, we survey some central issues in the historical, current, and future landscape of statistical machine translation (SMT) research, taking as a starting point an extended three-dimensional MT model space. We posit a socio-geographical conceptual disparity hypothesis, that aims to explain why language pairs like Chinese-English have presented MT with so much more difficulty than others. The evolution from simple token-based to segment-based to tree-based syntactic SMT is sketched. For tree-based SMT, we consider language bias rationales for selecting the degree of compositional power within the hierarchy of expressiveness for transduction grammars (or synchronous grammars). This leads us to inversion transductions and the ITG model prevalent in current state-of-the-art SMT, along with the underlying ITG hypothesis, which posits a language universal. Against this backdrop, we enumerate a set of key open questions for syntactic SMT. We then consider the more recent area of semantic SMT. We list principles for successful application of sense disambiguation models to semantic SMT, and describe early directions in the use of semantic role labeling for semantic SMT.
Keywords :
computational linguistics; language translation; programming language semantics; statistics; SMT semantic; hierarchy transduction grammars; inversion transductions; semantics; sense disambiguation models; set key open; socio geographical conceptual; statistical machine translation; statistics; syntax; three dimensional MT model space; token based evolution; toward machine translation; tree based syntactic SMT; Computer science; Hardware; Humans; Labeling; Machine learning; Pattern recognition; Space technology; Speech recognition; Statistics; Surface-mount technology;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5373509