DocumentCode
2172715
Title
A new alignment algorithm for parallel corpora of Japanese and Chinese
Author
Quan, Yuhua ; Jin, Ying-hao ; Quan, Jingji
Author_Institution
Dept. of Foreign Language, Tonghua Normal Univ., Tonghua, China
fYear
2011
fDate
9-11 Sept. 2011
Firstpage
3498
Lastpage
3501
Abstract
To improve the alignment efficiency of sentence for parallel corpora, a new method is presented. It builds the non-grammar relation between source and target sentences by boolean transition, computes average match scores well and truly by boolean transition of sentences, and classifies all parallel sentences into 1:1, 1:n, n:1 and n:m by alignment relations. This method can not only increase the alignment efficiency, but also improve the accuracy rate of sentence alignment. Experiments on computer show that this new method is more adaptable and practicable.
Keywords
Boolean functions; computational linguistics; natural language processing; pattern classification; text analysis; Boolean transition; Chinese corpora; Japanese corpora; alignment algorithm; alignment efficiency; alignment relation; match score; nongrammar relation; parallel corpora; parallel sentence classification; sentence alignment; source sentence; target sentence; Algorithm design and analysis; Computational linguistics; Computers; Educational institutions; Grammar; Research and development; Semantics; alignment; boolean transition; computational linguistics; parallel corpora; semantic;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronics, Communications and Control (ICECC), 2011 International Conference on
Conference_Location
Ningbo
Print_ISBN
978-1-4577-0320-1
Type
conf
DOI
10.1109/ICECC.2011.6066458
Filename
6066458
Link To Document