• DocumentCode
    2172715
  • Title

    A new alignment algorithm for parallel corpora of Japanese and Chinese

  • Author

    Quan, Yuhua ; Jin, Ying-hao ; Quan, Jingji

  • Author_Institution
    Dept. of Foreign Language, Tonghua Normal Univ., Tonghua, China
  • fYear
    2011
  • fDate
    9-11 Sept. 2011
  • Firstpage
    3498
  • Lastpage
    3501
  • Abstract
    To improve the alignment efficiency of sentence for parallel corpora, a new method is presented. It builds the non-grammar relation between source and target sentences by boolean transition, computes average match scores well and truly by boolean transition of sentences, and classifies all parallel sentences into 1:1, 1:n, n:1 and n:m by alignment relations. This method can not only increase the alignment efficiency, but also improve the accuracy rate of sentence alignment. Experiments on computer show that this new method is more adaptable and practicable.
  • Keywords
    Boolean functions; computational linguistics; natural language processing; pattern classification; text analysis; Boolean transition; Chinese corpora; Japanese corpora; alignment algorithm; alignment efficiency; alignment relation; match score; nongrammar relation; parallel corpora; parallel sentence classification; sentence alignment; source sentence; target sentence; Algorithm design and analysis; Computational linguistics; Computers; Educational institutions; Grammar; Research and development; Semantics; alignment; boolean transition; computational linguistics; parallel corpora; semantic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronics, Communications and Control (ICECC), 2011 International Conference on
  • Conference_Location
    Ningbo
  • Print_ISBN
    978-1-4577-0320-1
  • Type

    conf

  • DOI
    10.1109/ICECC.2011.6066458
  • Filename
    6066458