• DocumentCode
    442045
  • Title

    Research on automatic acquisition of translation template based on error-driven learning method

  • Author

    Zhang, Chun-xiang ; Li, Sheng ; Zhao, Tie-jun ; Cao, Hai-Long

  • Author_Institution
    MOE-MS Key Lab. of Natural Language Process. & Speech, Harbin Inst. of Technol., China
  • Volume
    6
  • fYear
    2005
  • fDate
    18-21 Aug. 2005
  • Firstpage
    3723
  • Abstract
    Automatic acquisition of translation templates is important for MT system to improve its translation quality and its ability of adaptation to new domain. In this paper, translation equivalences are obtained from translation corresponding trees of bilingual sentence pairs. Error-driven learning method is employed to acquire templates from extracted equivalences. At the same time, optimization method based on automatic translation evaluation is used to clean these templates. Then they are applied to a transfer-based MT system, and "863" dialog corpus in 2003 is used for open test. Experimental results show that the performance of new acquired templates exceeds that of original ones. Combination of new acquired templates and original ones makes 5-gram Nist assessment score of open test corpus improve by 8.11%.
  • Keywords
    computational linguistics; equivalence classes; language translation; trees (mathematics); Nist assessment score; automatic translation evaluation; bilingual sentence pair; error-driven learning method; machine translation system; optimization; translation equivalence; translation template; Computer errors; Computer science; Laboratories; Learning systems; NIST; Natural language processing; Natural languages; Optimization methods; Speech processing; System testing; Nist assessment score; Translation template; automatic translation evaluation; error-driven learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
  • Conference_Location
    Guangzhou, China
  • Print_ISBN
    0-7803-9091-1
  • Type

    conf

  • DOI
    10.1109/ICMLC.2005.1527588
  • Filename
    1527588