DocumentCode :
2172715
Title :
A new alignment algorithm for parallel corpora of Japanese and Chinese
Author :
Quan, Yuhua ; Jin, Ying-hao ; Quan, Jingji
Author_Institution :
Dept. of Foreign Language, Tonghua Normal Univ., Tonghua, China
fYear :
2011
fDate :
9-11 Sept. 2011
Firstpage :
3498
Lastpage :
3501
Abstract :
To improve the alignment efficiency of sentence for parallel corpora, a new method is presented. It builds the non-grammar relation between source and target sentences by boolean transition, computes average match scores well and truly by boolean transition of sentences, and classifies all parallel sentences into 1:1, 1:n, n:1 and n:m by alignment relations. This method can not only increase the alignment efficiency, but also improve the accuracy rate of sentence alignment. Experiments on computer show that this new method is more adaptable and practicable.
Keywords :
Boolean functions; computational linguistics; natural language processing; pattern classification; text analysis; Boolean transition; Chinese corpora; Japanese corpora; alignment algorithm; alignment efficiency; alignment relation; match score; nongrammar relation; parallel corpora; parallel sentence classification; sentence alignment; source sentence; target sentence; Algorithm design and analysis; Computational linguistics; Computers; Educational institutions; Grammar; Research and development; Semantics; alignment; boolean transition; computational linguistics; parallel corpora; semantic;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics, Communications and Control (ICECC), 2011 International Conference on
Conference_Location :
Ningbo
Print_ISBN :
978-1-4577-0320-1
Type :
conf
DOI :
10.1109/ICECC.2011.6066458
Filename :
6066458
Link To Document :
بازگشت