• DocumentCode
    130027
  • Title

    A model of the MT lexicon for verbs as an interface between syntactic parsing and semantic representations

  • Author

    Alam, Yukiko Sasaki

  • Author_Institution
    Dept. of Digital Media, Hosei Univ., Koganei, Japan
  • fYear
    2014
  • fDate
    28-30 July 2014
  • Firstpage
    382
  • Lastpage
    387
  • Abstract
    Machine translation (MT) deals with a large amount of data which represents sentence meanings constructed on the basis of the grammatical rules and the lexical knowledge of the source and target languages. As it is hardly possible to make an automated semantic analysis of the source language data, predominant methods in use have been transfer models such as those of phrase-based statistical MT and pattern-based MT, the main task of which is to replace the source language text with the target language text without understanding them. As it is reported that nearly half the text data consist of unanalyzable multiword like collocations, idioms, and phrasal verbs, it would be inevitable to resort to such a method. At the same time, however, it has been observed that MT output quality suffers to a significant extent when the sentences are not divided into proper segments. This paper proposes a model of the lexicon of verbs whose rich relational information contributes to parsing and understanding sentences. While other existing models of lexicons have no or little consideration for their roles of interaction with processes of syntactic parsing and semantic representation, the proposed model is designed for playing a role as an interface between the two processes in handling sentences with regular usages of verbs as well as idiomatic and metaphoric usages.
  • Keywords
    grammars; language translation; statistical analysis; text analysis; MT lexicon model; automated semantic analysis; grammatical rules; idiomatic usages; lexical knowledge; machine translation; metaphoric usages; pattern-based MT; phrase-based statistical MT; relational information; semantic representation; sentence meanings; source language data; source language text; syntactic parsing; target language text; text data; transfer models; Abstracts; Compounds; Context; Educational institutions; Natural language processing; Semantics; Syntactics; Lexicon for a parser; Lexicon of verbs; Machine translation; Metaphoric usages of verbs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Automation (ICIA), 2014 IEEE International Conference on
  • Conference_Location
    Hailar
  • Type

    conf

  • DOI
    10.1109/ICInfA.2014.6932686
  • Filename
    6932686