DocumentCode :
3489925
Title :
A Systematic Comparison of SVM and Maximum Entropy Classifiers for Translation Error Detection
Author :
Jinhua Du ; Sha Wang
Author_Institution :
Sch. of Autom. & Inf. Eng., Xi´an Univ. of Technol., Xi´an, China
fYear :
2012
fDate :
13-15 Nov. 2012
Firstpage :
125
Lastpage :
128
Abstract :
In recent years, the translation error detection or confidence estimation for SMT has been becoming a hot question, especially in the localization industry. This paper mainly focuses on a systematic comparison on two different classifiers Maximum Entropy (MaxEnt) and SVM over different features to illustrate their error detection capabilities. Three typical word posterior probabilities (WPP) and three linguistic features are introduced and fairly compared over two classifiers on Chinese to-English NIST datasets. Experimental results show that the combination of WPP with linguistic features can significantly reduce the CER, and the SVM classifier outperforms the MaxEnt classifier in terms of the CER and F measure.
Keywords :
language translation; maximum entropy methods; natural language processing; pattern classification; probability; support vector machines; CER; Chinese to-English NIST datasets; F measure; MaxEnt; SM; SVM; WPP; confidence estimation; error detection capabilities; localization industry; maximum entropy classifiers; statistical machine translation; systematic comparison; translation error detection; word posterior probabilities; Estimation; Feature extraction; NIST; Pragmatics; Support vector machines; Syntactics; Systematics; Error Detection; Linguistic Features; Maximum Entropy Classifier; SVM classifier; Word Posterior Probability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4673-6113-2
Electronic_ISBN :
978-0-7695-4886-9
Type :
conf
DOI :
10.1109/IALP.2012.20
Filename :
6473712
Link To Document :
بازگشت