Title :
Cross sentence alignment based on singular value decomposition
Author :
Ho, Anna ; Wong, Fai ; de Oliveveira, F. ; Li, Yi-ping
Author_Institution :
Fac. of Sci. & Technol., Univ. of Macau, Macao
Abstract :
This paper describes a way of performing alignment of Portuguese-Chinese bilingual pairs from the given two documents. Extracting the alignment pairs is a critical step for building Portuguese-Chinese bilingual corpus for Example Based Machine Translation Systems (EBMT). In short, the proposed alignment system performs four steps: break down the document into sentences level, score each pair of sentence by different features, apply Singular Value Decomposition on the results, extract the aligned pair base on a similarity function.
Keywords :
language translation; natural languages; singular value decomposition; text analysis; EBMT; Portuguese-Chinese bilingual corpus; bilingual document alignment; cross sentence alignment; example based machine translation system; similarity function; singular value decomposition; Cybernetics; Dictionaries; Government; Machine learning; Matrix decomposition; Natural languages; Noise measurement; Noise reduction; Singular value decomposition; Testing; Bilingual document alignment; Cross sentence alignment; Segmentation; Singular value decomposition;
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
DOI :
10.1109/ICMLC.2008.4620833