• DocumentCode
    677184
  • Title

    Parallel phrase extraction from English-Vietnamese parallel corpora

  • Author

    Quang-Hung Le ; Anh-Cuong Le ; Van-Nam Huynh

  • Author_Institution
    Fac. of Inf. Technol., Quynhon Univ., Quynhon, Vietnam
  • fYear
    2013
  • fDate
    10-13 Nov. 2013
  • Firstpage
    175
  • Lastpage
    179
  • Abstract
    Parallel phrases are important for some Natural Language Processing (NLP) tasks such as machine translation or cross language information retrieval. This paper proposes a novel method to extract parallel phrases from English-Vietnamese parallel corpora. In this method, we use predefined syntactic patterns and phrase translation probabilities for determining parallel phrases. The experiments are conducted on English-Vietnamese parallel corpora and have shown that our method increases 79:72% of Fscore for obtaining parallel phrases in comparison with a baseline.
  • Keywords
    language translation; natural language processing; probability; text analysis; English-Vietnamese parallel corpora; NLP tasks; cross language information retrieval; machine translation; natural language processing tasks; parallel phrase extraction; phrase translation probabilities; predefined syntactic patterns; Conferences; Equations; Mathematical model; Natural language processing; Pattern matching; Power capacitors; Syntactics; Parallel phrase; constrained word alignment model; parallel corpora; statistical measure; syntactic pattern;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2013 IEEE RIVF International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4799-1349-7
  • Type

    conf

  • DOI
    10.1109/RIVF.2013.6719889
  • Filename
    6719889